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GROUP B STREPTOCOCCUS POLYPEPTIDES NUCLEIC ACIDS AND 
THERAPEUTIC COMPOSITIONS AND VACCINES THEREOF 

5 

GOVERNMENTAL SUPPORT 

The research leading to the present invention was supported, at least in part, by a grant 
from NAID, Grant No.A140918. Accordingly, the Government may have certain 
10 rights in the invention. 

FIELD OF THE INVENTION 

This invention relates generally to extracellular matrix adhesin (Ema) proteins, 
15 antibodies thereto and to vaccines, compositions and therapeutics. The Group B 
streptococcal Ema polypeptides are EmaA, EmaB, EmaC, EmaD and EmaE. The 
invention further relates to Ema polypeptides from various species of bacteria, 
including S. pneumoniae^ S. pyogenes, E, faecalis and C diptheriae. The invention 
also relates to the identification and prevention of infections by streptococci. Isolated 
20 nucleic acids encoding Group B streptococcal Ema polypeptides, particularly EmaA, 
EmaB, EniaC, EmaD and EmaE and to other bacterial Ema homologs are included 
herein. Assays for compounds which modulate the polypeptides of the present 
invention for use in therapy are also provided. 

25 BACKGROUND OF THE INVENTION 

Streptococci are catalase negative gram positive cocci. They may be classified by the 
type of hemolysis exhibited on blood agar, by the serologic detection of carbohydrate 
antigens, or by certain biochemical reactions. Medically important streptococci include 
30 Groups A, B, D, S. pneumoniae and the viridans group of streptococci Lancefield 

type A (GroupA) Streptococcus pyogenes is an important human pathogen - the cause 
of streptococcal pharyngitis, impetigo and more severe infections such as bacteremia 
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and necrotizing fascitis. The immunologic sequelae of Group A Streptococcal 
infections are also important health problems - rheumatic carditis is the most common 
cause of acquired cardiac disease worldwide and post-streptococcal glomerulonephritis 
is a cause of hypertension and renal dysfunction. Group B Streptococcus agalactiae are 
5 the most common cause of serious bacterial infections in newborns, and important 
pathogens in pregnant women and nonpregnant adults with underlying medical 
problems such as diabetes and cardiovascular disease. Group D streptococci include 
the enterococci (Streptococcus faecalis and faecium) and the "nonenterococcal" Group 
D streptococci. Streptococcus pneumoniae (pneumococcus) is not classified by group 

10 in the Lancefield system. Pneumocogci are extremely important human pathogens, the 
most common cause of bacterial pneumonia, middle ear infections and meningitis 
beyond the newborn period. The viridans group of streptococci include S. milleri, S. 
mitis, S. sanguis and others. They cause bacteremia, endocarditis, and dental 
infections. Enterococci are important causes of urinary tract infections, bacteremia 

15 and wound infections (predominantly as nosocomial infections in hospitalized patients), 
and endocarditis. Over the past decade enterococci have developed resistance to many 
conventional antibiotics and there are some strains resistant to all known antibiotics. 

Group B streptococci (GBS) are the most common cause of serious bacterial disease 
20 in neonates, and are important pathogens in pregnant women and adults with 
underlying illnesses (Baker CJ. (2000) "Group B streptococcal infections" in 
Streptococcal infections. Clinical aspects, microbiology, and molecular 
pathogenesis, (D. L. Stevens and E. L. Kaplan), New York: Oxford University Press, 
222-237). Common manifestations of these infections include bacteremia, pneumonia, 
25 meningitis, endocarditis, and osteoarticular infections (Baker CJ. (2000) "Group B 
streptococcal infections" in Streptococcal infections. Clinical aspects, microbiology, 
and molecular pathogenesis. (D. L. Stevens and E. L. Kaplan), New York: Oxford 
University Press, 222-237; Blumberg H.M. et al. (1996) J Infect Dis 173:365-373). 
The incidence of invasive GBS disease is approximately 2.6 in 1000 live births and 7.7 
30 in 100,000 in the overall population, with mortality rates that vary from 6 to 30% 
(Baker CJ. (2000) "Group B streptococcal infections" \n Streptococcal infections. 
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Clinical aspects, microbiology, and molecular pathogenesis. (D. L. Stevens and E. L. 
Kaplan), New York; Oxford University Press, 222-237; Blumberg H.M. et al. (1996) J 
Infect Dis 173:365-373). Although much neonatal disease is preventable by 
administration of prophylactic antibiotics to women in labor, antibiotic prophylaxis 
5 programs can be inefficient, suffer from poor compliance, or fail if antibiotic resistance 
emerges. No effective prophylaxis strategy for adult infections has been established. 

During childbirth, GBS can pass from the mother to the newborn. By one estimate, up 
' to 30% of pregnant women carry GBS at least temporarily in the vagina or rectum 

10 without symptoms. Infants born to these women become colonized with GBS during 
delivery (Baker, C.J. and Edwards, M.S. (1995) "Group B Streptococcal Infections" in 
Infectious Disease of the Fetus and Newborn Infant (J.S. Remington and J.O Klein), 
980-1054). Aspiration of infected amniotic fluid or vaginal secretions allow GBS to 
gain access to the lungs. Adhesion to, and invasion of, respiratory epithelium and 

15 endothelium appear to be critical factors in early onset neonatal infection. (Baker, C. J. 
and Edwards, M.S. (1995) "Group B Streptococcal Infections" \n Infectious Disease 
of the Fetus and Newborn Infant (J.S. Remington and J.O Klein), 980-1054; Rubens, 
C.E. et al. (1991) JInfDis 164:320-330). Subsequent steps in infection, such as blood 
stream invasion and the establishment of metastatic local infections have not been 

20 clarified. The pathogenesis of neonatal infection occurring after the first week of life is 
also not well understood. Gastrointestinal colonization may be more important than a 
respiratory focus in late onset neonatal disease (Baker, C J. and Edwards, M.S. (1995) 
"Group B Streptococcal Infections" in Infectious Disease of the Fetus and Newborn 
Infant (J.S. Remington and J.O Klein), 980-1054). Considerable evidence suggests that 

25 invasion of brain microvascular endothelial cells by GBS is the initial step in the 
pathogenesis of meningitis. GBS are able to invade human brain microvascular 
endothelial cells and type III GBS, which are responsible for the majority of meningitis, 
accomplish this 2-6 times more eflEiciently than other serotypes (Nizet, V. et al. (1997) 
Infect Immun 65:5074-5081). 
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Because GBS is widely distributed among the population and is an important pathogen 
in newborns, pregnant women are commonly tested for GBS at 35-37 weeks of 
pregnancy. Much of GBS neonatal disease is preventable by administration of 
prophylactic antibiotics during labor to women who test positive or display known risk 
5 factors. However, these antibiotics programs do not prevent all GBS disease. The 
programs are deficient for a number of reasons. First, the programs can be inefficient. 
Second, it is difl&cult to ensure that all healthcare providers and patients comply with 
the testing and treatment. And finally, if new serotypes or antibiotic resistance 
emerges, the antibiotic programs may fail altogether. Currently available tests for GBS 
10 are inefficient. These tests may proyide false negatives. Furthermore, the tests are not 
specific to virulent strains of GBS. Thus, antibiotic treatment may be given 
unnecessarily and add to the problem of antibiotic resistance. Although a vaccine 
would be advantageous, none are yet commercially available. 

15 Traditionally, GBS are divided into 9 serotypes according to the immunologic 

reactivity of the polysaccharide capsule (Baker CJ. (2000) "Group B streptococcal 
infections" in Streptococcal infections. Clinical aspects, microbiology, and molecular 
pathogenesis, (D. L. Stevens and E. L. Kaplan), New York: Oxford University Press, 
222-237; Blumberg H.M. et al. (1996) J Infect Dis 173:365-373; Kogan, G. et al. 

20 (1996) J Biol Chem 271:8786-8790). Serotype III GBS are particularly important in 
human neonates, causing 60-70% of all infections and almost all meningitis (Baker CJ. 
(2000) "Group B streptococcal infections" in Streptococcal infections. Clinical 
aspects, microbiology, and molecular pathogenesis. (D. L. Stevens and E. L. Kaplan), 
New York: Oxford University Press, 222-237). Type III GBS can be subdivided into 

25 three groups of related strains based on the analysis of restriction digest pattems 

(RDPs) produced by digestion of chromosomal DNA with /f/wfif III and i&eSSST (I. Y. 
Nagano et al. (1991) JMedMicro 35:297-303; S. Takahashi et al. (1998) JInfDis 
177:1116-1119). 

30 Over 90% of invasive type III GBS neonatal disease in Tokyo, Japan and in Salt Lake 
City, Utah is caused by bacteria from one of three RDF types, termed RDF type III-3, 
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while RDP type 111-2 are significantly more likely to be isolated from vagina than from 
blood or CSF. These results suggest that this genetically-related cluster of t5^e III-3 
GBS are more virulent than 111-2 strains and could be responsible for the majority of 
invasive type III disease globally. 

5 

Preliminary vaccines for GBS used unconjuated purified polysaccaride. GBS poly - 
and oligosaccharides are poorly immunogenic and fail to elicit significant memory and 
booster responses. Baker et al immunized 40 pregnant women with purified serotype 
III capsular polysaccharide (Baker, CJ. et al. (1998) New Engl J of Med 

10 319: 1 1 80- 1 1 85). Overall, only 57% of women with low levels of specific antibody 

responded to the vaccine. The poor immunogenicity of purified polysaccharide antigen 
was fiirther demonstrated in a study in which thirty adult volunteers were immunized 
with a tetravalent vaccine composed of purified polysaccharide fi-om serotypes la, lb, 
II, and III (Kotloff, KX. et al (1996) Vaccine 14:446-450). Although safe, this 

15 vaccine was only modestly immunogenic, with only 13% of subjects responding to 
type lb, 17% to type II, 33% responding to type la, and 70% responding to type III 
polysaccharide. The poor immunogenicity of polysaccharide antigens prompted efforts 
to develop polysaccharide conjugate vaccines, whereby these poly - or 
oligosaccharides are conjugated to protein carriers. Ninety percent of healthy adult 

20 women immunized with a type III polysaccharide-tetanus toxoid conjugate vaccine 
responded with a 4-fold rise in antibody concentration, compared to 50% immunized 
with plain polysaccharide (Kasper, D.L, et al (1996) J of Clin Invest 98:2308-23 14). 
A type la/Ib polysaccharide-tetanus toxoid conjugate vaccine was similarly more 
immunogenic in healthy adults than plain polysaccharide (Baker, C.J. et al (1999) J 

25 Infect Dis 179:142-150). 

The disadvantage of polysaccharide-protein conjugate vaccines is that the process of 
purifying and conjugating polysaccharides is difficult, time-consuming and expensive. 
A protein antigen which could be cheaply and easily produced would be an 
30 improvement. 



wo 02/12294 



PCT/USOl/24795 



6 

If one were to make a polysaccharide-protein conjugate vaccine, a GBS-specific 
carrier protein may be preferable to one of the commonly used carriers such as tetanus 
or diphtheria toxoids because of the potential problems associated with some of these 
carrier proteins, particularly variable immunogenicity and the problems associated with 
5 repeated vaccination with the same carrier protein. Selection of appropriate carrier 
proteins is important for the development of polysaccharide-protein vaccine 
formulations. For example, Haemophilus influenzae type b poly- or oligosaccharide 
conjugated to different protein carriers has variable immunogenicity and elicits 
antibody with varying avidity (Decker, M.D. et al (1992) J Pediatrics 120: 184-189; 

10 Schlesinger, Y. (1992) JAMA 267: 1489-1494). Repeated immunization with the same 
carrier protein may also suppress immune responses by competition for specific B cells 
(epitopic suppression) or other mechanisms. This is of particular concern for the 
development of GBS vaccines since recently developed poly/oligosaccharide-protein 
conjugate vaccines against the bacteria H, influenzae^ S, pneumoniae^ and A^. 

15 meningitidis all utilize a restricted number of carrier proteins (tetanus toxoid, 

CRM 197, diptheria toxoid), increasing the number of exposures to these carriers an 
individual is likely to receive. Additionally, using tetanus as a carrier protein offers no 
specific advantage beyond the improved inmiunogenicity of the vaccine. A 
second-generation vaccine containing a GBS-specific carrier protein would enhance 

20 immunogenicity and have an advantage in that a GBS-specific immune response would 
be generated against both the carrier protein and the poly/oligosaccharide. 

Therefore, in view of the aforementioned deficiencies attendant with prior art vaccines 
and methods, it should be apparent that there still exists a need in the art for an 

25 effective and immunogenic GBS vaccine. The availability and use of a GBS 

polypeptide in a conjugate vaccine is desirable. A GBS polypeptide which is present 
or expressed in all GBS serotypes would have the added advantage of providing broad, 
general immunity across many GBS serotypes. It would be particularly relevant and 
useful to provide a streptococcal vaccine or immunogen which is expressed broadly in 

30 various streptococcal species, whereby broad or general immunity against multiple and 
unique groups of streptococci (for instance. Group A, Group B and S, pneumoniae). 



wo 02/12294 



PCT/USOl/24795 



7 

particularly against distinct virulent and clinically relevant streptococcal bacteria, could 
thereby be generated. 

The citation of references herein shall not be construed as an admission that such is 
5 prior art to the present invention. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, streptococcal polypeptides termed 
10 extracellular matrix adhesins (Ema) ^e provided which are particularly useful in the 
identification and prevention of infections by streptococci. 

In its broadest aspect, the present invention encompasses isolated polypeptides 
comprising an amino acid sequence of a streptococcal polypeptide selected from the . 
15 group of EmaA, EmaB, EmaC, EmaD and EmaE. The isolated peptides, including 
combinations of one or more thereof, are suitable for use in immunizing animals and 
humans against bacterial infection, particularly streptococci. 

The present invention is directed to an isolated streptococcal EmaA polypeptide which 
20 comprises the amino acid sequence set out in SEQ ID NO: 2, and analogs, variants and 
immunogenic fragments thereof. 

The present invention is directed to an isolated streptococcal EmaB polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 4, and analogs, variants and 
25 immunogenic fragments thereof . 

The present invention is directed to an isolated streptococcal EmaC polypeptide which 
comprises the amino acid sequence set out in SEQ ED NO: 6, and analogs, variants and 
immunogenic fi-agments thereof 

30 
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The present invention is directed to an isolated streptococcal EmaD polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 8, and analogs, variants and 
immunogenic fragments thereof 

5 The present invention is directed to an isolated streptococcal EmaE polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 10, and analogs, variants 
and immunogenic fragments thereof 

The present invention also provides Ema polypeptide homologs from distinct bacterial 
10 species, particularly including distinct streptococcal species, more particularly 

including Group B streptococcus, Group A streptococcus (particularly S, pyogenes) 
and iS. pneumoniae. The present invention also provides Ema polypeptides from 
additional distinct bacterial species, particularly including Enterococcus faecalis and 
Corynebacterium diptheriae. Nucleic acids encoding Ema polypeptide homologs from 
15 distinct bacterial species are also provided. 

The present invention thus provides an isolated streptococcal Ema polypeptide 
comprising the amino acid sequence set out in SEQ ID NO:23. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:23 is further 
20 provided. 

The invention thus further provides an isolated streptococcal Ema polypeptide 
comprising the amino acid sequence set out in SEQ ID NO:26. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:26 is further 
25 provided. 

The present invention further provides an isolated streptococcal Ema polypeptide 
comprising the amino acid sequence set out in SEQ ED NO:37. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ED NO:37 is further 
30 provided. 
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An enterococcal Ema polypeptide is further provided comprising the amino acid 
sequence set out in SEQ ID NO:29. An isolated isolated nucleic acid which encodes 
the enterococcal polypeptide set out in SEQ ID NO:29 is also provided. 

5 The invention provides an isolated Corynebacterium Ema polypeptide comprising the 
amino acid sequence set out in SEQ ID NO: 32. Also provided is an isolated nucleic 
acid which encodes the Corynebacterium polypeptide set out in SEQ ID NO: 32. 

The invention provides an isolated bacterial polypeptide comprising the amino acid 
10 sequence TLLTCTPYMINS/THRL;.VR/KG (SEQ ID NO: 34), wherein the 
polypeptide is not isolated from Actinomyces. 

The invention further provides an isolated streptococcal polypeptide comprising the 
amino acid sequence TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34). 

15 

Also provided is an isolated bacterial polypeptide comprising the amino acid sequence 
TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35). 

The present invention includes an isolated bacterial polypeptide comprising the amino 
20 acid sequence TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36). An isolated 
streptococcal polypeptide comprising the amino acid sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36) is also provided. 

The invention further includes an isolated polypeptide having the amino acid sequence 
25 selected from the group of TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34), 

TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35), and TLVTCTPYGVNTKRLLVRG 
(SEQ ID NO: 36). 

The present invention contemplates the use of the polypeptides of the present invention 
30 in diagnostic tests and methods for determining and/or monitoring of streptococcal 
infection. Thus, the present invention provides an isolated Ema pols^eptide. 
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particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, 
labeled with a detectable label. 

In the instance where a radioactive label, such as the isotopes ^H, ^'^C, ^^P, ^^S, ^^Cl, 
5 ^^Cr, ^^Co, ^^Co, ^^Fe, ^°Y, ^^^I, ^^^I, and ^^^Re are used, known currently available 
counting procedures may be utilized. In the instance where the label is an enzyme, 
detection may be accomplished by any of the presently utilized colorimetric, 
spectrophotometric, fluorospectrophotometric, amperometric or gasometric techniques 
known in the art, 

10 

The present invention extends to an immunogenic Ema polypeptide, particularly 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, or a fragment 
thereof The present invention also extends to immunogenic Ema polypeptides 
wherein such polypeptides comprise a combination of at least one immunogenic Ema 
15 polypeptide, selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, or 
immunogenic polypeptide fragment thereof, and a GBS polypeptide selected from the 
group of Spbl, Spb2, C protein alpha antigen. Rib, Lmb, C5a-ase, or immunogenic 
fragments thereof 

20 In a fiirther aspect, the present invention extends to vaccines based on the Ema 

proteins described herein. The present invention provides a vaccine comprising one or 
more streptococcal polypeptides selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable adjuvant. The present invention 
provides a vaccine comprising one or more streptococcal polypeptides selected from 

25 the group of the polypeptide of SEQ ID NO: 23, 26, and 37, and a pharmaceutically 
acceptable adjuvant. 

The present invention further provides a streptococcal vaccine comprising one or more 
Group B streptococcal polypeptides selected from the group of EmaA, EmaB, EmaC, 
30 EmaD and EmaE, further comprising one or more additional streptococcal antigens. 
The present invention further provides a GBS vaccine comprising one or more Group 
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B streptococcal polypeptides selected from the group of EmaA, EmaB, EmaC, EmaD 
and EmaE, further comprising one or more additional GBS antigens. In a particular 
embodiment, the GBS antigen is selected from the group of the polypeptide Spbl or an 
immunogenic fragment thereof, the polypeptide Spb2 or an immunogenic fragment 
5 thereof, C protein alpha antigen or an immunogenic fragment thereof. Rib or an 

immunogenic fragment thereof Lmb or an immunogenic fragment thereof, C5a-ase or 
an immunogenic fragment thereof and Group B streptococcal polysaccharides or 
oligosaccharides. 

10 In another aspect, the invention is directed to a vaccine for protection of an animal 

subject from infection with streptococci comprising an immunogenic amount of one or 
more Ema polypeptide EmaA, EmaB, EmaC, EmaD or EmaE, or a derivative or 
fragment thereof Such a vaccine may contain the protein conjugated covalently to a 
GBS bacterial polysaccharide or oligosaccharide or polysaccharide or oligosaccharide 

15 from one or more GBS serotypes. 

In a still further aspect, the present invention provides an immunogenic composition 
comprising one of more streptococcal polypeptides selected from the group of EmaA, 
EmaB, EmaC, EmaD and EmaE, and a pharmaceutically acceptable adjuvant. 

20 

The present invention further provides an immunogenic composition comprising one or 
more Group B streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE, further comprising one or more antigens selected from the 
group of the polypeptide Spbl or an immunogenic fragment thereof, the polypeptide 
25 Spb2 or an immunogenic fragment thereof, C protein alpha antigen or an immunogenic 
fragment thereof, Rib or an immunogenic fragment thereof Lmb or an immunogenic 
fragment thereof, C5a-ase or an immunogenic fragment thereof, and Group B 
streptococcal polysaccharides or oligosaccharides. 

30 The invention further provides pharmaceutical compositions, vaccines, and diagnostic 
and therapeutic methods of use thereof 
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The invention provides pharmaceutical compositions comprising a bacterial Ema 
polypeptide and a pharmaceutically acceptable carrier. The invention provides 
pharmaceutical compositions comprising a streptococcal polypeptide selected from the 
group of EmaA, EmaB, EmaC, EmaD and EmaE, the polypeptide of SEQ ED NO:23, 
5 the polypeptide of SEQ ID NO: 26, the polypeptide of SEQ ID NO:37, and a 
pharmaceutically acceptable carrier. The invention provides pharmaceutical 
compositions comprising a streptococcal polypeptide selected from the group of 
EmaA, EmaB, EmaC, EmaD and EmaE, and a pharmaceutically acceptable carrier. 
The present invention further provides pharmaceutical compositions comprising one or 
10 more GBS Ema polypeptide, or a fragment thereof, in combination with one or more 
of GBS polypeptide Spbl, Spb2, C protein alpha antigen. Rib, Lmb, C5a-ase, a Group 
B streptococcal polysaccharide or oligosaccharide vaccine, and an anti-streptococcal 
vaccine. 

15 In a still further aspect, the present invention provides a purified antibody to a 

streptococcal polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE. In a still further aspect, the present invention provides a purified antibody to a 
streptococcal polypeptide selected from the group of the polypeptide of SEQ ID 
NO:23, the polypeptide of SEQ ID NO: 26, and the polypeptide of SEQ ID NO:37. 

20 

Antibodies against the isolated polypeptides of the present invention include naturally 
raised and recombinantly prepared antibodies. These may include both polyclonal and 
monoclonal antibodies prepared by known genetic techniques, as well as bi-specific 
(chimeric) antibodies, and antibodies including other functionalities suiting them for 

25 diagnostic use. Such antibodies can be used in immunoassays to diagnose infection 
with a particular strain or species of bacteria. The antibodies can also be used for 
passive immunization to treat an infection with streptococcal bacteria including Group 
B streptococcus. Group A streptococcus, and S. pneumoniae. These antibodies may 
also be suitable for modulating bacterial adherence and/or invasion including but not 

30 limited to acting as competitive agents. 
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The present invention provides a monoclonal antibody to a streptococcal polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. The invention 
thereby extends to an immortal cell line that produces a monoclonal antibody to a 
streptococcal poypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and 
5 EmaE. 

An antibody to a streptococcal Ema polypeptide EmaA, EmaB, EmaC,. EmaD or 

EmaE labeled with a detectable label is further provided. In particular embodiments, 

the label may selected from the group consisting of an enzyme, a chemical which 

10 fluoresces, and a radioactive element. 

« 

The present invention provides a pharmaceutical composition comprising one or more 
antibodies to a streptococcal protein selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable carrier. The invention further 

15 provides a pharmaceutical composition comprising a combination of at least two 
antibodies to Group B streptococcal proteins and a pharmaceutically acceptable 
carrier, wherein at least one antibody to a protein selected from the group of EmaA, 
EmaB, EmaC, EmaD, and EmaE is combined with at least one antibody to a protein 
selected from the group of Spbl, Spb2, Rib, Lmb, C5a-ase and a C protein alpha 

20 antigen. . 

The present invention also relates to isolated nucleic acids, such as recombinant DNA 
molecules or cloned genes, or degenerate variants thereof, mutants, analogs, or 
fragments thereof, which encode the isolated polypeptide of the present invention or 

25 which competitively inhibit the activity of the polypeptide. The present invention 

further relates to isolated nucleic acids, such as recombinant DNA molecules or cloned 
genes, or degenerate variants thereof, mutants, analogs, or fragments thereof, which 
encode a bacterial Ema polypeptide. The present invention further relates to isolated 
nucleic acids, such as recombinant DNA molecules or cloned genes, or degenerate 

30 variants thereof, mutants, analogs, or fragments thereof, which encode a streptococcal 
Ema polypeptide. The present invention further relates to isolated nucleic acids, such 
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as recombinant DNA molecules or cloned genes, or degenerate variants thereof, 
mutants, analogs, or fragments thereof, which encode a streptococcal Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE. Preferably, the isolated nucleic acid, which includes degenerates, variants, 
5 mutants, analogs, or fragments thereof, has a sequence as set forth in SEQ ID NOS: 1, 
3, 5, 7 or 9. In a further embodiment of the invention, the DNA sequence of the 
recombinant DNA molecule or cloned gene may be operatively linked to an expression 
control sequence which may be introduced into an appropriate host. The invention 
accordingly extends to unicellular hosts transformed with the cloned gene or 
10 recombinant DNA molecule comprising a DNA sequence encoding an Ema protein, 
particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and 
more particularly, the DNA sequences or fragments thereof determined from the 
sequences set forth above. 

15 In a particular embodiment, the nucleic acid encoding the EmaA polypeptide has the 
sequence selected from the group comprising SEQ ID NO:l; a sequence that 
hybridizes to SEQ ID NO: 1 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 1 or 
a sequence that hybridizes to SEQ ID NO: 1 under moderate stringency hybridization 

20 conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaA polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 1; a sequence 
complementary to SEQ ID NO: 1 ; or a homologous sequence which is substantially 
similar to SEQ ID NO: 1 . In a fiirther embodiment, the nucleic acid has the sequence 

25 consisting of SEQ ID NO: 1 . 

In a particular embodiment, the nucleic acid encoding the EmaB polypeptide has the 
sequence selected from the group comprising SEQ ID NO:3; a sequence that 
hybridizes to SEQ ID NO:3 under moderate stringency hybridization conditions; DNA 
30 sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 3 or 
a sequence that hybridizes to SEQ ID NO:3 under moderate stringency hybridization 
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conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaB polypeptide 
has the sequence selected from the group comprising SEQ ID NO:3; a sequence 
complementary to SEQ ID NO:3; or a homologous sequence which is substantially 
5 similar to SEQ ID NO:3. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:3. 

In a particular embodiment, the nucleic acid encoding the EmaC polypeptide has the 
sequence selected from the group comprising SEQ ID NO:5; a sequence that 
hybridizes to SEQ ID NO: 5 under nioderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 5 or 
a sequence that hybridizes to SEQ ID NO: 5 under moderate stringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaC polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 5; a sequence 
complementary to SEQ ID NO: 5; or a homologous sequence which is substantially 
similar to SEQ ID NO:5. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:5. 

f20 In a particular embodiment, the nucleic acid encoding the EmaD polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 7; a sequence that 
hybridizes to SEQ ID NO: 7 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 7 or 
a sequence that hybridizes to SEQ ID NO: 7 under moderate stringency hybridization 

25 conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof. In a particular embodiment, the nucleic acid encoding the EmaD polypeptide 
has the sequence selected from the group comprising SEQ ID NO:7; a sequence 
complementary to SEQ ED NO:7; or a homologous sequence which is substantially 
similar to SEQ ID NO:7. In a further embodiment, the nucleic acid has the sequence 

30 consisting of SEQ ID NO:7. 



10 



15 
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In a particular embodiment, the nucleic acid encoding the EmaE polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 9; a sequence that 
hybridizes to SEQ ID NO: 9 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO:9 or 
5 a sequence that hybridizes to SEQ ED NO: 9 under moderate stringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaE polypeptide 
has the sequence selected from the group comprising SEQ ID NO:9; a sequence 
complementary to SEQ ID NO:9; or a homologous sequence which is substantially 
10 similar to SEQ ID NO:9 In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:9. 

In a further embodiment, the nucleic acid encoding the bacterial Ema polypeptide 
comprises the sequence selected from the group comprising SEQ ID NO: 24, 27, 30 
15 and 33. In a further embodiment, the nucleic acid encoding the bacterial Ema 

polypeptide has the sequence selected from the group comprising SEQ ID NO: 24, 27, 
30 and 33. 

A nucleic acid capable of encoding a streptococcal polypeptide EmaA, EmaB, EmaC, 
20 EmaD or EmaE which is a recombinant DNA molecule is further provided. Such a 
recombinant DNA molecule wherein the DNA molecule is operatively linked to an 
expression control sequence is also provided herein. 

- The present invention relates to nucleic acid vaccines or DNA vaccines comprising 
25 nucleic acids encoding immunogenic streptococcal Ema polypeptides, particularly 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. The present 
invention relates to nucleic acid vaccines or DNA vaccines comprising nucleic acids 
encoding one or more immunogenic Ema polypeptide or a fragment thereof or any 
combination of one or more Ema polypeptide EmaA, EmaB, EmaC, EmaD or EmaE 
30 with at least one other polypeptide, particularly a GBS polypeptide, more particularly 
wherein said other GBS polypeptide is selected fi*om the group of Spbl, Spb2, C 
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protein alpha antigen. Rib, Lmb, C5a-ase, and immunogenic polypeptide fragments 
thereof. 

The invention further relates to a vaccine for protection of an animal subject from 
5 infection with a streptococcal bacterium comprising a vector containing a gene 
encoding an Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE operatively associated with a promoter capable of directing 
expression of the gene in the subject. The present invention further provides a nucleic 
acid vaccine comprising a recombinant DNA molecule capable of encoding a GBS 
10 polypeptide EmaA, EmaB, EmaC, EmaD or EmaE. 

The invention further relates to a vaccine for protection of ah animal subject from 
infection with a Group B streptococcal bacterium comprising a vector containing a 
gene encoding an Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
15 EmaD and EmaE operatively associated with a promoter capable of directing 

expression of the gene in the subject. The present invention further provides a nucleic 
acid vaccine comprising a recombinant DNA molecule capable of encoding a GBS 
polypeptide EmaA, EmaB, EmaC, EmaD or EmaE. 

20 The present invention provides a vector which comprises the nucleic acid capable of 
encoding encoding an Ema polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE and a promoter. The present invention provides a vector 
which comprises the nucleic acid of any of SEQ ID NO: 1, 3, 5, 7 or 9 and a 
promoter. The invention contemplates a vector wherein the promoter comprises a 

25 bacterial, yeast, insect or mammalian promoter. The invention contemplates a vector 
wherein the vector is a plasmid, cosmid, yeast artificial chromosome (YAC), 
bacteriophage or eukaryotic viral DNA. 



30 



The present invention fiirther provides a host vector system for the production of a 
polypeptide which comprises the vector capable of encoding an Ema polypeptide, 
particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE in a 
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suitable host cell. A host vector system is provided wherein the suitable host cell 
comprises a prokaryotic or eukaryotic cell. A xuiicellular host transformed with a 
recombinant DNA molecule or vector capable of encoding encoding an Ema 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE is 
5 thereby provided. 

The present invention includes methods for determining and monitoring infection by 
streptococci by detecting the presence of a streptococcal polypeptide selected from the 
group of EmaA, EmaB, EmaC, EmaD and EmaE. In a particular such method, the 
10 streptococcal Ema polypeptide is measured by: 

a, contacting a sample in which the presence or activity of a 
Streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE is suspected with an antibody to the said 
streptococcal polypeptide under conditions that allow binding of the 

15 streptococcal polypeptide to the antibody to occur; and 

b. detecting whether binding has occurred between the streptococcal 
polypeptide from the sample and the antibody; 

20 wherein the detection of binding indicates the presence or activity of the streptococcal 
polypeptide in the sample. 

The present invention includes methods for determining and monitoring infection by 
streptococci by detecting the presence of a streptococcal polypeptide selected from the 
25 group of EmaA, EmaB, EmaC, EmaD and EmaE. In a particular such method, the 
streptococcal Ema polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a 

Streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE is suspected with an antibody to the said 
30 streptococcal polypeptide under conditions that allow binding of the 

streptococcal polypeptide to the antibody to occur; and 
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b. detecting whether binding has occurred between the streptococcal 
polypeptide from the sample and the antibody; 

wherein the detection of binding indicates the presence or activity of the 
5 streptococcal polypeptide in the sample. 

The present invention includes methods for determining and monitoring infection by 
Group B streptococci by detecting the presence of a Group B streptococcal 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. In a 
10 particular such method, the streptococcal Ema polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a Group B 
streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE is suspected with an antibody to the said 
Group B streptococcal polypeptide under conditions that allow binding 

15 of the Group B streptococcal polypeptide to the antibody to occur; and 

b. detecting whether binding has occurred between the Group B 
streptococcal polypeptide from the sample and the antibody; 

20 

wherein the detection of binding indicates the presence or activity of the Group B 
streptococcal polypeptide in the sample. 

The present invention further provides a method for detecting the presence of a 
25 bacterium having a gene encoding a streptococcal polypeptide selected from the group 
of emoA, emaB, emaC, emaD and emaE, comprising: 

a, contacting a sample in which the presence or activity of the bacterium 
is suspected with an oligonucleotide which hybridizes to a 
streptococcal polypeptide gene selected from the group of emoA, 
30 emaB, emaC, emaD and emaE, under conditions that allow specific 

hybridization of the oligonucleotide to the gene to occur; and 
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b. detecting whether hybridization has occurred between the 
oligonucleotide and the gene; 

wherein the detection of hybridization indicates that presence or activity of the 
5 bacterium in the sample. 

The invention includes an assay system for screening of potential compounds effective 
to modulate the activity of a streptococcal protein EmaA, EmaB, EmaC, EmaD or 
EmaE of the present invention. In one instance, the test compound, or an extract 

10 containing the compound, could be administered to a cellular sample expressing the 
particular Ema protein to determine the compound's effect upon the activity of the 
protein by comparison with a control. In a further instance the test compound, or an 
extract containing the compound, could be administered to a cellular sample 
expressing the Ema protein to determine the compound's effect upon the activity of 

15 the protein, and thereby on adherence of said cellular sample to host cells, by 
comparison with a control. 

It is still a further object of the present invention to provide a method for the 
prevention or treatment of mammals to control the amount or activity of streptococci, 
20 so as to treat or prevent the adverse consequences of invasive, spontaneous, or 
idiopathic pathological states. 

It is still a further object of the present invention to provide a method for the 
prevention or treatment of mammals to control the amount or activity of Group B 
streptococci, so as to treat or prevent the adverse consequences of invasive, 
25 spontaneous, or idiopathic pathological states. 

The invention provides a method for preventing infection with a bacterium that 
expresses a streptococcal Ema polypeptide comprising administering an 
immunogenically effective dose of a vaccine comprising an Ema polypeptide selected 
30 from the group of EmaA, EmaB, EmaC, EmaD and EmaE to a subject. 
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The invention further provides a method for preventing infection with a bacterium that 
expresses a Group B streptococcal Ema polypeptide comprising administering an 
immunogenically effective dose of a vaccine comprising an Ema polypeptide selected 
from the group of EmaA, EmaB, EmaC, EmaD and EmaE to a subject. 

5 

The present invention is directed to a method for treating infection with a bacterium 
that expresses a streptococcal Ema polypeptide comprising administering a 
therapeutically effective dose of a pharmaceutical composition comprising an Ema 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and a 
10 pharmaceutically acceptable carrier to a subject. 



The invention further provides a method for treating infection with a bacterium that 
expresses a streptococcal Ema polypeptide comprising administering a therapeutically 
effective dose of a pharmaceutical composition comprising an antibody to an Ema 
15 polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and a 
pharmaceutically acceptable carrier to a subject. 



In a further aspect, the invention provides a method of inducing an immune response 
in a subject which has been exposed to or infected with a streptococcal bacterium 
20 comprising administering to the subject an amount of the pharmaceutical composition 
comprising an Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable carrier, thereby inducing an 
immune response. 



25 The invention still further provides a method for preventing infection by a 

streptococcal bacterium in a subject comprising administering to the subject an amount 
of a pharmaceutical composition comprising an antibody to an Ema polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE and a 
pharmaceutically acceptable carrier or diluent, thereby preventing infection by a 

30 streptococcal bacterium. 
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In a further aspect, the invention provides a method of inducing an immune response 
in a subject v^hich has been exposed to or infected with a Group B streptococcal 
bacterium comprising administering to the subject an amount of the pharmaceutical 
composition comprising an Ema polypeptide selected from the group of EmaA, EmaB, 
5 EmaC, EmaD and EmaE, and a pharmaceutically acceptable carrier, thereby inducing 
an immune response. 

The invention still further provides a method for preventing infection by a Group B 
streptococcal bacterium in a subject comprising administering to the subject an amount 
10 of a pharmaceutical composition comprising an antibody to an Ema polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE and a 
pharmaceutically acceptable carrier or diluent, thereby preventing infection by a 
streptococcal bacterium, 

15 The invention further provides an ema mutant bacteria which is non-adherent and/or 
non-invasive to cells, particularly which is mutated in one or more genes selected from 
the group of emoA, emaB, emaC, emaD and emaE. Particularly, such ema mutant is a 
streptococcal bacteria. More particularly, such ema mutant is a Group B 
streptococcal bacteria. Such non-adherent and/or non-invasive ema mutant bacteria 

20 can further be utilized in expressing other immunogenic or therapeutic proteins for the 
purposes of eliciting immune responses to any such other proteins in the context of 
vaccines and in other forms of therapy. 

Other objects and advantages will become apparent to those skilled in the art from a 
25 review of the following description which proceeds with reference to the following 
illustrative drawings. 

BRIEF DESCRIPTTON OF THE D RAWINGS 



30 



FIGURE 1 depicts the restriction digest pattern (RDP) type 111-3 specific probes. 
Dot blot hybridization of probe DYl-1 with genomic DNA isolated from type III 
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GBS. 10 ug of genomic DNA from each of 62 type III GBS strains was transferred to 
nylon membrane. Radiolabeled probe DYl-1 hybridized with DNA from all 111-3 
strains (rows A-D) including the original type III-3 strain (well E-1). The probe failed 
to hybridize with DNA from III-2 strains (Fl- FIO, Gl-7) including the original strain 
5 used in the subtraction hybridization (well E 10) and IIH strains (wells HI -3; cf 
Figure 3). The same pattern of hybridization was observed using probe DYl-1 1 . 

FIGURE 2 depicts the nucleic acid and predicted amino acid sequence of emaA. 

10 FIGURE 3 depicts the nucleic acid ^nd predicted amino acid sequence ofemaB. 

FIGURE 4 depicts the nucleic acid and predicted amino acid sequence of emaC, 

FIGURE 5 depicts the nucleic acid and predicted amino acid sequence of emaD. 

15 

FIGURE 6 A-D depicts the nucleic acid and predicted amino acid sequence of emaE. 

DETAILED DESCRIPTION 

20 The present invention provides novel Group B streptococcal Ema polypeptides and 
their Ema homologs in distinct bacterial species, including distinct streptococcal 
species. The present invention relates to novel streptococcal Ema polypeptides, 
particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and 
fragments thereof Nucleic acids encoding Ema polypeptides, and diagnostic and 

25 therapeutic compositions and methods based thereon for identification and prevention 
of infections by virulent forms of streptococci are provided. In particular, the present 
invention includes Group B streptococcal Ema polypeptides. The invention fiirther 
includes polypeptide homologs of the GBS Ema polypeptides, particularly 
streptococcal homologs, more particularly Ema homologs of S. pneumoniae and S, 

30 pyogenes. Bacterial Ema polypeptide homologs in E, faecalis and C diptheriae are 
also provided. 
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Polypeptides 

The present invention is directed to an isolated polypeptide comprising an amino acid 
5 sequence of a bacterial Ema polypeptide. Bacterial Ema polypepties are provided 
from streptococcus, enterococcus and corynebacterium. The present invention is 
particularly directed to an isolated polypeptide comprising an amino acid sequence of a 
streptococcal Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE. The present invention is particularly directed to an isolated 
10 polypeptide comprising an amino acid sequence of a Group streptococcal Ema 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. 
Additional S. pneumoniae and S. pyogenes Ema polypeptides are included in the 
invention. E. faecalis and C. diptheriae Ema polypeptides are also included in the 
invention. 

15 

The polypeptides of the present invention are suitable for use in immunizing animals 
broadly against streptococcal infection. The polypeptides of the present invention are 
suitable for use in Ltnmunizing animals broadly against Group B, Group A, and S. 
pneumoniae streptococcal infection. The polypeptides of the present invention are 
20 suitable for use in inununizing animals against Group B streptococci. These 
polypeptide or peptide fragments thereof, when formulated with an appropriate 
adjuvant, are used in vaccines for protection against streptococci, particularly Group B 
streptococci, and against other bacteria with cross-reactive proteins. 

25 GBS proteins with streptococcal homologs outside of Group B have been previously 
identified (Lachenauer CS and MadofFLC (1991) Adv Exp Med BioL 418:615-8; 
Brady L.J. et al (1991) Infect Immun 59(12):4425-35; Stahlhammer-Carlemalm M. et 
al (2000) J Infect Dis 182(1): 142-129). Stahlhammer-Carlemalm et al have 
demonstrated cross-protection between Group A and Group B streptococci due to 

30 cross-reacting surface proteins (Stahlhammer-Carlemalm M. et al (2000) J Infect Dis 
182(1): 142-129). The R28 protein of group A streptococcus (GAS) and the Rib 
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protein of group B streptococcus (GBS) are surface molecules that elicit protective 
immunity to experimental infection. These proteins are members of the same family 
and cross-react immunologically. In spite of extensive amino acid residue identity, the 
cross-reactivity between R28 and Rib was found to be limited, as shown by analysis 
5 with highly purified proteins and specific antisera. Nevertheless, immunization of mice 
with purified R28 conferred protection against lethal infection with Rib-expressing 
GBS strains, and immunization with Rib conferred protection against R28-expressing 
GAS. Thus, R28 and Rib elicited cross-protective immunity. 

10 The present invention is directed to an isolated streptococcal EmaA polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 2, and analogs, variants 
and immunogenic firagments thereof 

The present invention is directed to an isolated streptococcal EmaB polypeptide which 
15 comprises the amino acid sequence set out in SEQ ID NO: 4, and analogs, variants 
and immunogenic fragments thereof 

The present invention is directed to an isolated streptococcal EnoiaC polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 6, and analogs, variants 
20 and immunogenic fragments thereof 

The present invention is directed to an isolated streptococcal EmaD polypeptide which 
comprises the amino acid sequence set out in SEQ ID NO: 8, and analogs, variants 
and immunogenic fragments thereof 

25 

The identity or location of one or more amino acid residues may be changed or 
modified to include variants such as, for example, deletions containing less than all of 
the residues specified for the protein, substitutions wherein one or more residues 
specified are replaced by other residues and additions wherein one or more amino acid 
30 residues are added to a terminal or medial portion of the polypeptide. These 

molecules include: the incorporation of codons "preferred" for expression by selected 
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non-mammalian hosts; the provision of sites for cleavage by restriction endonuclease 
enzymes; and the provision of additional initial, terminal or intermediate DNA 
sequences that facilitate construction of readily expressed vectors. 

5 The present invention is directed to an isolated Group B streptococcal EmaE 

polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 10, and 
analogs, variants and immunogenic fragments thereof 

The present invention thus provides an isolated streptococcal Ema polypeptide 
10 comprising the amino acid sequence, set out in SEQ ID NO:23. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:23 is further 
provided. 

The invention thus further provides an isolated streptococcal Ema polypeptide 
15 comprising the amino acid sequence set out in SEQ ID NO:26. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:26 is further 
provided. 

The present invention further provides an isolated streptococcal Ema polypeptide 
20 comprising the amino acid sequence set out in SEQ ID NO:37. An isolated nucleic 
acid which encodes the streptococcal polypeptide set out in SEQ ID NO:37 is further 
provided. 

An enterococcal Ema polypeptide is further provided comprising the amino acid 
25 sequence set out in SEQ ID NO:29. An isolated isolated nucleic acid which encodes 
the enterococcal polypeptide set out in SEQ ID NO;29 is also provided. 

The invention provides an isolated Corynebacterhim Ema polypeptide comprising the 
amino acid sequence set out in SEQ ID NO: 32. Also provided is an isolated nucleic 
30 acid which encodes the Corynebacterium polypeptide set out in SEQ ID NO: 32. 
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The invention provides an isolated bacterial polypeptide comprising the an^no acid 
sequence TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34), wherein the 
polypeptide is not isolated &om Actinomyces. 



5 The invention further provides an isolated streptococcal polypeptide comprising the 
amino acid sequence TLLTCTPYMINS/THRLLVR/KG (SEQ ED NO: 34). 



Also provided is an isolated bacterial polypeptide comprising the amino acid sequence 
TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35). 

10 

4 

The present invention includes an isolated bacterial polypeptide comprising the amino 
acid sequence TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36). An isolated 
streptococcal polypeptide comprising the amino acid sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36) is also provided. 

15 

The invention further includes an isolated polypeptide having the amino acid sequence 
selected from the group of TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34), 
TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35), and TLVTCTPYGVNTKRLLVRG 
(SEQ ID NO: 36). 

20 

The present invention contemplates the use of the streptococcal polypeptides of the 
present invention in diagnostic tests and methods for determining and/or monitoring of 
streptococcal infection. Thus, the present invention provides an isolated GBS Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
25 BmaE, labeled with a detectable label. 

In the instance where a radioactive label, such as the isotopes ^H, ^'^C, ^^P, ^^S, ^^Cl, 
^^Cr, ""Co, ^»Co, '^e, ^°Y, "^I, and "^Re are used, known currently available 
counting procedures may be utilized. In the instance where the label is an enzyme, 
30 detection may be accomplished by any of the presently utilized colorimetric, 
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spectrophotometric, fluorospectrophotometric, amperometric or gasometric 
techniques known in the art. 

The present invention extends to an immunogenic bacterial Ema polypeptide. The 
5 present invention extends to an immunogenic streptococcal Ema polypeptide, 

particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, or a 
fragment thereof The present invention also extends to immunogenic GBS Ema 
polypeptides wherein such polypeptides comprise a combination of at least one 
immunogenic GBS Ema polypeptide, selected from the group of EmaA, EmaB, EmaC, 
10 EmaD and EmaE, or immunogenic polypeptide fragment thereof and GBS polypeptide 
Spbl, Spb2, C protein alpha antigen, Rib or immunogenic fragments thereof 

As defined herein, "adhesion" means noncovalent binding of a bacteria to a human cell 
or secretion that is stable enough to withstand washing. 

15 

The term "extracellular matrix adhesin", "Ema", "ema" and any variants not specifically 
listed, may be used herein interchangeably, and as used throughout the present 
application and claims refer to proteinaceous material including single or multiple 
proteins, and extends to those proteins having the amino acid sequence data described 

20 herein and particularly identified by (SEQ ID NOS: 2, 4, 6, 8, 10, 23, 26, 29, 32 and 
37), and the profile of activities set forth herein and in the Claims. In particular the 
Ema proteins provided herein include EmaA, EmaB, EmaC, EmaD and EmaE. The 
Ema proteins include bacterial Ema homologs. Bacterial Ema homologs include those 
from streptococcal species and other bacterial species. Accordingly, proteins and 

25 polypeptides displaying substantially equivalent or altered activity are likewise 
contemplated. These modifications may be deliberate, for example, such as 
modifications obtained through site-directed mutagenesis, or may be accidental, such 
as those obtained through mutations in hosts that are producers of one or more Ema 
polypeptide. Also, the term "extracellular matrix adhesin (Ema)" is intended to include 

30 within its scope proteins specifically recited herein as well as all substantially 
homologous analogs and allelic variations. 
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This invention provides an isolated immunogenic polypeptide comprising an amino 
acid sequence of a bacterial Ema polypeptide. This invention provides an isolated 
inmiunogenic polypeptide comprising an amino acid sequence of a streptococcal Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
5 EmaE. It is contemplated by this invention that the immunogenic polypeptide has the 
amino acid sequence set forth in any of SEQ ID NOS: 2, 4, 6, 8, 10, 23, 26, 29, 32 
and 37, including immunogenic fragments, mutants, variants, analogs, or derivatives, 
thereof. 

10 This invention is directed to analogs of the polypeptide which comprise the amino acid 
sequence as set forth above. The analog polypeptide may have an N-terminal 
methionine or a polyhistidine optionally attached to the N or COOH terminus of the 
polypeptide which comprise the amino acid sequence. 

15 In another embodiment, this invention contemplates peptide fragments of the 

polypeptide which result from proteolytic digestion products of the polypeptide. In 
another embodiment, the derivative of the polypeptide has one or more chemical 
moieties attached thereto. In another embodiment the chemical moiety is a water 
soluble polymer. In another embodiment the chemical moiety is polyethylene glycol. 

20 In another embodiment the chemical moiety is mon-, di-, tri- or tetrapegylated. In 
another embodiment the chemical moiety is N-terminal monopegylated. 

Attachment of polyethylene glycol (PEG) to compounds is particularly useftil because 
PEG has very low toxicity in mammals (Carpenter et al, 1971). For example, a PEG 

25 adduct of adenosine deaminase was approved in the United States for use in humans 
for the treatment of severe combined immunodeficiency syndrome. A second 
advantage afforded by the conjugation of PEG is that of effectively reducing the 
immunogenicty and antigenicity of heterologous compounds. For example, a PEG 
adduct of a human protein might be usefiil for the treatment of disease in other 

30 mammalian species without the risk of triggering a severe immune response. The 
compound of the present invention may be delivered in a microencapsulation device 
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SO as to reduce or prevent an host immune response against the compound or against 
cells which may produce the compound. The compound of the present invention may 
also be delivered microencapsulated in a membrane, such as a liposome. 

5 Numerous activated forms of PEG suitable for direct reaction with proteins have been 
described. Useful PEG reagents for reaction with protein amino groups include active 
esters of carboxylic acid or carbonate derivatives, particularly those in which the 
leaving groups are N-hydroxysuccinimide, p-nitrophenol, imidazole or l-hydroxy-2- 
nitrobenzene-4-sulfonate. PEG derivatives containing maleimido or haloacetyl groups 
10 are useful reagents for the modification of protein free sulfhydryl groups. Likewise, 
PEG reagents containing amino hydrazine or hydrazide groups are useful for reaction 
with aldehydes generated by periodate oxidation of carbohydrate groups in proteins. 

In one embodiment, the amino acid residues of the polypeptide described herein are 
15 preferred to be in the "L" isomeric form. In another embodiment, the residues in the 
"D" isomeric form can be substituted for any L-amino acid residue, as long as the 
desired functional property of lectin activity is retained by the polypeptide. NH2 refers 
to the jBree amino group present at the amino terminus of a polypeptide. COOH refers 
to the firee carboxy group present at the carboxy terminus of a polypeptide. 
20 Abbreviations used herein are in keeping with standard polypeptide nomenclature, J. 
Biol Chem., 243:3552-59 (1969). 

It should be noted that all amino-acid residue sequences are represented herein by 
formulae whose left and right orientation is in the conventional direction of amino- 
25 terminus to carboxy-terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a peptide bond to a 
further sequence of one or more amino-acid residues. 

Synthetic polypeptide, prepared using the well known techniques of solid phase, liquid 
30 phase, or peptide condensation techniques, or any combination thereof, can include 
natural and unnatural amino acids. Amino acids used for peptide synthesis may be 
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standard Boc (N^'-amino protected N"-t-butyIoxycarbonyl) amino acid resin with the 
standard deprotecting, neutralization, coupling and wash protocols of the original solid 
phase procedure of Merrifield (1963, J, Am, Chem. Soc. 85:2149-2154), or the base- 
labile N^-amino protected 9-fluorenylmethoxycarbonyl (Fmoc) amino acids first 
5 described by Carpino and Han (1972, J. Org. Chem, 37:3403-3409). Thus, 

polypeptide of the invention may comprise D-amino acids, a combination of D- and L- 
amino acids, and various "designer" amino acids (e,g.^ p-methyl amino acids, Ca- 
methyl amino acids, and Na-methyl amino acids, etc.) to convey special properties. 
Synthetic amino acids include ornithine for lysine, fluorophenylalanine for 

10 phenylalanine, and norleucine for leucine or isoleucine. Additionally, by assigning 

« 

specific amino acids at specific coupling steps, cc-helices, P turns, p sheets, y-turns, 
and cyclic peptides can be generated. 

In one aspect of the invention, the peptides may comprise a special amino acid at the 
15 C-terminus which incorporates either a COjH or CONH2 side chain to simulate a free 
glycine or a glycine-amide group. Another way to consider this special residue would 
be as a D or L amino acid analog with a side chain consisting of the linker or bond to 
the bead. In one embodiment, the pseudo-firee C-terminal residue may be of the D or 
the L optical configuration; in another embodiment, a racemic mixture of D and L- 
20 isomers may be used. 

In an additional embodiment, pyroglutamate may be included as the N-terminal residue 
of the peptide. Although pyroglutamate is not amenable to sequence by Edman 
degradation, by limiting substitution to only 50% of the peptides on a given bead with 

25 N-terminal pyroglutamate, there will remain enough non-pyroglutamate peptide on the 
bead for sequencing. One of ordinary skill would readily recognize that this technique 
could be used for sequencing of any peptide that incorporates a residue resistant to 
Edman degradation at the N-terminus. Other methods to characterize individual 
peptides that demonstrate desired activity are described in detail infra. Specific 

30 activity of a peptide that comprises a blocked N-terminal group, e.^., pyroglutamate, 
when the particular N-terminal group is present in 50% of the peptides, would readily 



wo 02/12294 PCT/USOl/24795 

32 

be demonstrated by comparing activity of a completely (100%) blocked peptide with a 
non-blocked (0%) peptide. 



In addition, the present invention envisions preparing peptides that have more well 
5 defined structural properties, and the use of peptidomimetics, and peptidomimetic 
bonds, such as ester bonds, to prepare peptides with novel properties. In another 
embodiment, a peptide may be generated that incorporates a reduced peptide bond, 
i.e., R1-CH2-NH-R2, where Ri and R2 are amino acid residues or sequences. A 
reduced peptide bond may be introduced as a dipeptide subunit. Such a molecule 

10 would be resistant to peptide bond hydrolysis, e.g., protease activity. Such peptides 
would provide ligands with unique function and activity, such as extended half-lives in 
vivo due to resistance to metabolic breakdown, or protease activity. Furthermore, it is 
well known that in certain systems constrained peptides show enhanced functional 
activity (Hruby, 1982, j'c/e/^ce^ 3 1:189-- 199; Hruby et al., 1990, Biochem J, 

15 268:249-262); the present invention provides a method to produce a constrained 
peptide that incorporates random sequences at all other positions. 



A constrained, cyclic or rigidized peptide may be prepared synthetically, provided that 
in at least two positions in the sequence of the peptide an amino acid or amino acid 

20 analog is inserted that provides a chemical functional group capable of cross-linking to 
constrain, cyclise or rigidize the peptide after treatment to form the cross-link. 
Cyclization will be favored when a turn-inducing amino acid is incorporated. 
Examples of amino acids capable of cross-linking a peptide are cysteine to form 
disulfide, aspartic acid to form a lactone or a lactase, and a chelator such as 

25 Y-carboxyl-glutamic acid (Gla) (Bachem) to chelate a transition metal and form a 
cross-link. Protected ycarboxyl glutamic acid may be prepared by modifying the 
synthesis described by Zee-Cheng and Olson (1980, Biophys. Biochem, Res, Commun. 
94: 1 1 28- 1 1 32). A peptide in which the peptide sequence comprises at least two 
amino acids capable of cross-linking may be treated, e.g,, by oxidation of cysteine 

30 residues to form a disulfide or addition of a metal ion to form a chelate, so as to cross- 
link the peptide and form a constrained, cyclic or rigidized peptide. 
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The present invention provides strategies to systematically prepare cross-links. For 

example, if four cysteine residues are incorporated in the peptide sequence, different 

protecting groups may be used (Hiskey, 1981, in The Peptides: Analysis, Synthesis, 

5 Biology, Vol. 3, Gross and Meienhofer, eds.. Academic Press: New York, pp. 137- 

167; Ponsanti et al., 1990, Tetrahedron 46:8255-8266). The first pair of cysteine may 

be deprotected and oxidized, then the second set may be deprotected and oxidized. In 

this way a defined set of disulfide cross-links may be formed. Alternatively, a pair of 

cysteine and a pair of collating amino acid analogs may be incorporated so that the 

10 cross-links are of a different chemical nature. 

« 

The following non-classical amino acids may be incorporated in the peptide in order to 
introduce particular conformational motifs: 1 ,2,3 ,4-tetrahydroisoquinoline-3 - 
carboxylate (Kazmierski et al, 1991, J. Am. Chem. Soc. 113:2275-2283); (2S,3S)- 

15 methyl-phenylalanine, (2S,3R)-methyl-phenylalanine, (2R,3S)-methyl-phenylalanine 
and (2R,3R)-methyl-phenylalanine (Kazmierski and Hruby, 1991, Tetrahedron Lett.); 
2-aminotetrahydronaphthalene-2-carboxylic acid (Landis, 1989, Ph.D. Thesis, 
University of Arizona); hydroxy-l,2,3,4-tetrahydroisoquinoline-3-carboxylate (Miyake 
et al, 1989, J. Takeda Res. Labs. 43:53-76); P-carboline (D and L) (Kazmierski, 

20 1988, Ph.D. Thesis, University of Arizona); HIC (histidine isoquinoline carboxylic 

acid) (Zechel et al., 1991, Int. J. Pep. Protein Res. 43); and HIC (histidine cyclic urea) 
(Dharanipragada). 

The following amino acid analogs and peptidomimetics may be incorporated into a 
25 peptide to induce or favor specific secondary structures: LL-Acp (LL-3-amino- 
2-propenidone-6-carboxylic acid), a P-tum inducing dipeptide analog (Kemp et al, 
1985, J. Org. Chem. 50:5834-5838); P-sheet inducing analogs (Kemp et al, 1988, 
Tetrahedron Lett, 29:5081-5082); P-turn inducing analogs (Kemp et al, 1988, 
Tetrahedron Lett. 29:5057-5060); «-helix inducing analogs (Kemp et al, 1988, 
30 Tetrahedron Lett 29:4935-4938); y-turn inducing analogs (Kemp et al., 1989, J. Org. 
Chem. 54: 109: 1 15); and analogs provided by the following references: Nagai and 
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Sato, 1985, Tetrahedron Lett, 26:647-650; DiMaio et al., 1989, J, Chem, Soc. Perkin 
Trans, p. 1687; also a Gly-Ala turn analog (Kahn et al., 1989, Tetrahedron Lett. 
30:2317); amide bond isostere (Jones et al,, 1988, Tetrahedron Lett 29:3853-3856); 
tretrazol (Zabrocki et al., 1988, J, Am. Chem. Soc. 110:5875-5880); DTC (Samanen 
5 et al, 1990, Int. J, Protein Pep. Res. 35:501 :509); and analogs taught in Olson et al., 
1990, J. Am, Chem. Set 112:323-333 and Garvey et al., 1990, J. Org, Chem, 56:436. 
Conformationally restricted mimetics of beta turns and beta bulges, and peptides 
containing them, are described in U.S. Patent No. 5,440,013, issued August 8, 1995 to 
Kahn. 

10 

The present invention further provides for modification or derivatization of the 
polypeptide or peptide of the invention. Modifications of peptides are well known to 
one of ordinary skill, and include phosphorylation, carboxymethylation, and acylation. 
Modifications may be effected by chemical or enzymatic means. In another aspect, 

15 glycosylated or fatty acylated peptide derivatives may be prepared. Preparation of 
glycosylated or fatty acylated peptides is well known in the art. Fatty acyl peptide 
derivatives may also be prepared. For example, and not by way of limitation, a free 
amino group (N-terminal or lysyl) may be acylated, e.g,^ myristoylated. In another 
embodiment an amino acid comprising an aliphatic side chain of the structure - 

20 (CH2)nCH3 may be incorporated in the peptide. This and other peptide-fatty acid 
conjugates suitable for use in the present invention are disclosed in U.K. Patent GB- 
8 809 1 62.4, International Patent Application PCT/AU89/00 1 66, and reference 5, 
supra. 



25 Chemical Moieties For Derivatization. Chemical moieties suitable for derivatization 
may be selected from among water soluble polymers. The polymer selected should be 
water soluble so that the component to which it is attached does not precipitate in an 
aqueous environment, such as a physiological environment. Preferably, for therapeutic 
use of the end-product preparation, the polymer will be pharmaceutically acceptable. 

30 One skilled in the art will be able to select the desired polymer based on such 
considerations as whether the polymer/component conjugate will be used 
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therapeutically, and if so, the desired dosage, circulation time, resistance to 
proteolysis, and other considerations. For the present component or components, 
these may be ascertained using the assays provided herein. 

5 The water soluble polymer may be selected from the group consisting of, for example, 
polyethylene glycol, copolymers of ethylene glycol/propylene glycol, 
carbo^tymethylcellulose, dextran, polyvinyl alcohol, poljrvinyl pyrrolidone, poly-1, 
3-dioxolane, poly-l,3,6-trioxane, ethylene/maleic anhydride copolymer, 
polyaminoacids (either homopolymers or random copolymers), and dextran or 
10 poly(n-vinyl pyrrolidone)polyethylene glycol, propropylene glycol homopolymers, 
prolypropylene oxide/ethylene oxide co- polymers, polyoxyethylated polyols and 
polyvinyl alcohol Polyethylene glycol propionaldenhyde may have advantages in 
manufacturing due to its stability in water. 

15 The polymer may be of any molecular weight, and may be branched or unbranched. 
For polyethylene glycol, the preferred molecular weight is between about 2kDa and 
about lOOkDa (the term "about" indicating that in preparations of polyethylene glycol, 
some molecules will weigh more, some less, than the stated molecular weight) for ease 
in handling and manufacturing. Other sizes may be used, depending on the desired 

20 therapeutic profile (e.g., the duratibn of sustained release desired, the effects, if any 
on biological activity, the ease in handling, the degree or lack of antigenicity and other 
known effects of the polyethylene glycol to a therapeutic protein or analog). 

The number of polymer molecules so attached may vary, and one skilled in the art will 
25 be able to ascertain the effect on function. One may mono-derivative, or may provide 
for a di-, tri-, tetra- or some combination of derivatization, with the same or different 
chemical moieties (e.g:, polymers, such as different weights of polyethylene glycols). 
The proportion of polymer molecules to component or components molecules will 
vary, as will their concentrations in the reaction mixture. In general, the optimum ratio 
30 (in terms of efficiency of reaction in that there is no excess unreacted component or 
components and polymer) will be determined by factors such as the desired degree of 
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derivatization (e.g., mono, di-, tri-, etc), the molecular weight of the polymer selected, 
whether the polymer is branched or unbranched, and the reaction conditions. 

The polyethylene glycol molecules (or other chemical moieties) should be attached to 
5 the component or components with consideration of effects on functional or antigenic 
domains of the protein. There are a number of attachment methods available to those 
skilled in the art, e.g,, EP 0 401 384 herein incorporated by reference (coupling PEG 
to G-CSF), see also Malik etaL, 1992, Exp. Hematol 20:1028-1035 (reporting 
pegylation of GM-CSF using tresyl chloride). For example, polyethylene glycol may 

10 be covalently bound through amino acid residues via a reactive group, such as, a free 
amino or carboxyl group. Reactive groups are those to which an activated 
polyethylene glycol molecule may be bound. The amino acid residues having a free 
amino group include lysine residues and the - terminal amino acid residues; those 
having a free carboxyl group include aspartic acid residues glutamic acid residues and 

15 the C-terminal amino acid residue. Sulfhydrl groups may also be used as a reactive 
group for attaching the polyethylene glycol molecule(s). Preferred for therapeutic 
purposes is attachment at an amino group, such as attachment at the N-terminus or 
lysine group. 

20 Nucleic Acids 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of 
the art. Such techniques are explained fully in the literature. See, e.g., Sambrook et 

25 al, "Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in 

Molecular Biology" Volumes MIX [Ausubel, R. M., ed. (1994)]; "Cell Biology: A 
Laboratory Handbook" Volumes UII [J. E. Cells, ed. (1994))]; "Current Protocols in 
Immunology" Volumes I-III [Coligan, J. E., ed, (1994)]; "Oligonucleotide Synthesis" 
(M.J. Gait ed. 1984); "Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. 

30 (1985)]; "Transcription And Translation" [B.D. Hames & S.J. Higgins, eds. (1984)]; 
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"Animal Cell Culture" [R.I. Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" 
[IRL Press, (1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" (1984). 

Mutations can be made in a nucleic acid encoding the polypeptide of the present 
5 invention such that a particular codon is changed to a codon which codes for a 
different amino acid. Such a mutation is generally made by making the fewest 
nucleotide changes possible. A substitution mutation of this sort can be made to 
change an amino acid in the resulting protein in a non-conservative manner (i.e., by 
changing the codon from an amino acid belonging to a grouping of amino acids having 
10 a particular size or characteristic to an amino acid belonging to another grouping) or in 
a conservative manner (i.e., by changing the codon from an amino acid belonging to a 
grouping of amino acids having a particular size or characteristic to an amino acid 
belonging to the same grouping). Such a conservative change generally leads to less 
change in the structure and function of the resulting protein. A non-conservative 
15 change is more likely to alter the structure, activity or function of the resulting protein. 
The present invention should be considered to include sequences containing 
conservative changes which do not significantly alter the activity or binding 
characteristics of the resulting protein. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid 
20 belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, 

leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. Amino 
acids containing aromatic ring structures are phenylalanine, tryptophan, and tyrosine. 
The polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, 
asparagine, and glutamine. The positively charged (basic) amino acids include 
25 arginine, lysine and histidine. The negatively charged (acidic) amino acids include 
aspartic acid and glutamic acid. Such alterations will not be expected to affect 
apparent molecular weight as determined by polyacrylamide gel electrophoresis, or 
isoelectric point. 

30 Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 



wo 02/12294 



PCT/USOl/24795 



38 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free ISIH2 can be maintained. 

5 Synthetic DNA sequences allow convenient construction of genes which will express 
analogs or "muteins". A general method for site-specific incorporation of unnatural 
amino acids into proteins is described inNoren, et al. Science, 244:182-188 (April 
1989). This method may be used to create analogs with unnatural amino acids. 

10 This invention provides an isolated nucleic acid encoding a polypeptide comprising an 
amino acid sequence of a streptococcal Ema polypeptide. This invention provides an 
isolated nucleic acid encoding a polypeptide comprising an amino acid sequence of a 
streptococcal Ema polypeptide. This invention provides an isolated nucleic acid 
encoding a polypeptide comprising an amino acid sequence of a Group B 

15 streptococcal Ema polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE. This invention provides an isolated nucleic acid encoding a 
polypeptide comprising an amino acid sequence of a Group B streptococcal Ema 
protein selected from the group of Ema proteins EmA, EmaB, EmaC, EmaD and 
EmaE as set forth in FIGURES 2-6. The invention provides an isolated nucleic acid 

20 encoding a polypeptide comprising an amino acid sequence of a bacterial Ema 
polypeptide selected from the group of SEQ ID NO: 23, 26, 29, 32 and 37. In 
particular embodiments the nucleic acid is set forth in any of SEQ ID NOS: 1, 3, 5, 7, 
9, 24, 27, 30, and 33, including fragments, mutants, variants, analogs, or derivatives, 
thereof The nucleic acid is DNA, cDNA, genomic DNA, RNA. Further, the isolated 

25 nucleic acid may be operatively linked to a promoter of RNA transcription. 

The present invention also relates to isolated nucleic acids, such as recombinant DNA 
molecules or cloned genes, or degenerate variants thereof, mutants, analogs, or 
fragments thereof, which encode the isolated polypeptide or which competitively 
30 inhibit the activity of the polypeptide. The present invention fiirther relates to isolated 
nucleic acids, such as recombinant DNA molecules or cloned genes, or degenerate 



wo 02/12294 



PCT/USOl/24795 



39 

variants thereof, mutants, analogs, or fragrhents thereof, which encode a GBS Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE. Preferably, the isolated nucleic acid, which includes degenerates, variants, 
mutants, analogs, or fragments thereof, has a sequence as set forth in SEQ ID NOS: 1, 
5 3, 5, 7 or 9. In a further embodiment of the invention, the DNA sequence of the 

recombinant DNA molecule or cloned gene may be operatively linked to an expression 
control sequence which may be introduced into an appropriate host. The invention 
accordingly extends to unicellular hosts transformed with the cloned gene or 
recombinant DNA molecule comprising a DNA sequence encoding an Ema protein, 
10 particularly selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE^ and 

4 

more particularly, the DNA sequences or fragments thereof determined from the 
sequences set forth above. 

In a particular embodiment, the nucleic acid encoding the EmaA polypeptide has the 
15 sequence selected from the group comprising SEQ ID NO: 1; a sequence that 

hybridizes to SEQ ID NO:l mder moderatestringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO :l or 
a sequence that hybridizes to SEQ ID NO: 1 under moderatestringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
20 thereof. In a particular embodiment, the nucleic acid encoding the EmaA polypeptide 
has the sequence selected from the group comprising SEQ ID N0:1; a sequence 
complementary to SEQ ID NO:l; or a homologous sequence which is substantially 
similar to SEQ ID NO: 1. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO: 1 . 

25 

In a particular embodiment, the nucleic acid encoding the EmaB polypeptide has the 
sequence selected from the group comprising SEQ ID NO:3; a sequence that 
hybridizes to SEQ ID N0:3 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 3 or 
30 a sequence that hybridizes to SEQ ID NO:3 under moderate stringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof. In a particular embodiment, the nucleic acid encoding the EmaB polypeptide 
has the sequence selected from the group comprising SEQ ID NO:3; a sequence 
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complementary to SEQ ID NO:3; or a homologous sequence which is substantially 
similar to SEQ ID NO:3. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO;3. 

5 In a particular embodiment, the nucleic acid encoding the EmaC polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 5; a sequence that 
hybridizes to SEQ ID NO: 5 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 5 or 
a sequence that hybridizes to SEQ ID NO: 5 under moderate stringency hybridization 

10 conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof. In a particular embodiment, the nucleic acid encoding the EmaC polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 5; a sequence 
complementary to SEQ ID NO: 5; or a homologous sequence which is substantially 
similar to SEQ ID NO: 5, In a further embodiment, the nucleic acid has the sequence 

15 consisting of SEQ ID NO:5. 

In a particular embodiment, the nucleic acid encoding the EmaD polypeptide has the 
sequence selected from the group comprising SEQ ID NO: 7; a sequence that 
hybridizes to SEQ ID NO: 7 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ ID NO: 7 or 
a sequence that hybridizes to SEQ ID NO: 7 under moderate stringency hybridization 
conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof In a particular embodiment, the nucleic acid encoding the EmaD polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 7; a sequence 
complementary to SEQ ID NO:7; or a homologous sequence which is substantially 
similar to SEQ ID NO:7. In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID NO:7. 

In a particular embodiment, the nucleic acid encoding the EmaE polypeptide has the 
30 sequence selected from the group comprising SEQ ID NO: 9; a sequence that 

hybridizes to SEQ ID NO: 9 under moderate stringency hybridization conditions; DNA 
sequences capable of encoding the amino acid sequence encoded by SEQ, ID NO: 9 or 
a sequence that hybridizes to SEQ ID NO: 9 under moderate stringency hybridization 



20 



25 
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conditions; degenerate variants thereof; alleles thereof; and hybridizable fragments 
thereof. In a particular embodiment, the nucleic acid encoding the EmaE polypeptide 
has the sequence selected from the group comprising SEQ ID NO: 9; a sequence 
complementary to SEQ ID NO:9; or a homologous sequence which is substantially 
5 similar to SEQ ID NO: 9 In a further embodiment, the nucleic acid has the sequence 
consisting of SEQ ID N0:9. 

A nucleic acid capable of encoding a GBS polypeptide EmaA, EmaB, EmaC, EmaD or 
EmaE which is a recombinant DNA molecule is further provided. Such a recombinant 
10 DNA molecule wherein the DNA molecule is operatively linked to an expression 
control sequence is also provided herein. 

The present invention relates to nucleic acid vaccines or DNA vaccines comprising 
nucleic acids encoding immunogenic bacterial Ema polypeptides, particularly 

15 immunogenic streptococcal Ema polypeptides. The present invention relates to nucleic 
acid vaccines or DNA vaccines comprising nucleic acids encoding immunogenic GBS 
Ema polypeptides, particularly selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE. The present invention relates to nucleic acid vaccines or DNA 
vaccines comprising nucleic acids encoding one or more immimogenic GBS Ema 

20 polypeptide or a fragment thereof or any combination of one or more Ema polypeptide 
EmaA, EmaB, EmaC, EmaD or EmaE with at least one other GBS polypeptide, 
particularly wherein said other GBS polypeptide is selected from the group of Spbl, 
Spb2, C protein alpha antigen, Rib and immunogenic polypeptide fragments thereof. 

25 The invention further relates to a vaccine for protection of an animal subject from 
infection with a streptococcal bacterium comprising a vector containing a gene 
encoding an Ema polypeptide, particularly selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE, operatively associated with a promoter capable of directing 
expression of the gene in the subject. The invention further relates to a vaccine for 

30 protection of an animal subject from infection with a Group B streptococcal bacterium 
comprising a vector containing a gene encoding an Ema polypeptide selected from the 
group of EmaA, EmaB, EmaC, EmaD and EmaE operatively associated with a 
promoter capable of directing expression of the gene in the subject. The present 
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invention further provides a nucleic acid vaccine comprising a recombinant DNA 
molecule capable of encoding a GBS polypeptide EmaA, EmaB, EmaC, EmaD oi 
EmaE. 



5 The present invention provides a vector which comprises the nucleic acid capable of 
encoding encoding a bacterial Ema polypeptide, particularly a streptococcal Ema 
polypeptide. The present invention provides a vector which comprises the nucleic acid 
capable of encoding encoding an Ema polypeptide selected from the group of EmaA, 
EmaB, EmaC, EmaD and EmaE and a promoter. The present invention provides a 
10 vector which comprises the nucleic acid of any of SEQ ID NO: 1, 3, 5, 7, 9, 24, 27, 
30, and 33, and a promoter. The invention contemplates a vector wherein the 
promoter comprises a bacterial, yeast, insect or mammalian promoter. The invention 
contemplates a vector wherein the vector is a plasmid, cosmid, yeast artificial 
chromosome (YAC), bacteriophage or eukaryotic viral DNA. 



15 



The present invention further provides a host vector system for the production of a 
polypeptide which comprises the vector capable of encoding encoding an Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, in a suitable host cell. A host vector system is provided wherein the suitable 
20 host cell comprises a prokaryotic or eukaryotic cell. A unicellular host transformed 
with a recombinant DNA molecule or vector capable of encoding encoding an Ema 
polypeptide, particularly selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, is thereby provided. 



25 A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA 
segment may be attached so as to bring about the replication of the attached segment. 

A "DNA" or "DNA molecule" refers to the polymeric form of deoxyribonucleotides 
(adenine, guanine, thymine, or cytosine) in its either single stranded form, or a double- 
30 stranded helix. This term refers only to the primary and secondary structure of the 
molecule, and does not limit it to any particular tertiary forms. Thus, this term 
@ includes double-stranded DNA found, inter alia, in linear DNA molecules, (e.g., 

restriction fragments), viruses, plasmids, and chromosomes. In discussing the 
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structure of particular double-stranded DNA molecules, sequences may be described 
herein according to the normal convention of giving only the sequence in the 5' to 3' 
direction along the nontranscribed strand of DNA (i.e., the strand having a sequence 
homologous to the mRNA). 

5 

An "origin of replication" refers to those DNA sequences that participate in DNA 
synthesis. 

A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed 
10 and translated into a polypeptide in vivo when placed under the control of appropriate 
regulatory sequences. The boundaries of the coding sequence are determined by a 
start codon at the 5* (amino) terminus and a translation stop codon at the 3* (carboxyl) 
terminus. A coding sequence can include, but is not limited to, prokaryotic sequences, 
cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g., 
15 mammalian) DNA, and even synthetic DNA sequences. A polyadenylation signal and 
transcription termination sequence will usually be located 3' to the coding sequence in 
the case of eukaryotic mRNA. 

Transcriptional and translational control sequences are DNA regulatory sequences, 
20 such as promoters, enhancers, polyadenylation signals, terminators, and the like, that 
provide for the expression of a coding sequence in a host cell. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3' direction) coding 

25 sequence. For purposes of defining the present invention, the promoter sequence is 
bounded at its 3' terminus by the transcription initiation site and extends upstream (5' 
direction) to include the minimum number of bases or elements necessary to initiate 
transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined by mapping with 

30 nuclease SI), as well as protein binding domains (consensus sequences) responsible for 
the binding of RNA polymerase. Eukaryotic promoters will often, but not always, 
contain "TATA" boxes and "CAT" boxes. Prokaryotic promoters contaiij Shine- 
Dalgarno sequences in addition to the -10 and -35 consensus sequences. 
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An "expression control sequence" is a DNA sequence that controls and regulates the 
transcription and translation of another DNA sequence. A coding sequence is "under 
the control" of transcriptional and translational control sequences in a cell when RNA 
polymerase transcribes the coding sequence into naKNA, which is then translated into 
5 the protein encoded by the coding sequence. 

A "signal sequence" can be included before the coding sequence. This sequence 
encodes a signal peptide, N-terminal to the polypeptide, that communicates to the host 
cell to direct the polypeptide to the cell surface or secrete the polypeptide into the 
10 media, and this signal peptide is clipped off by the host cell before the protein leaves 
the cell. Signal sequences can be found associated with a variety of proteins native to 
prokaryotes and eukaryotes. 

The term "oligonucleotide," as used herein in referring to the probe of the present 
15 invention, is defined as a molecule comprised of two or more ribonucleotides, 

preferably more than three. Its exact size will depend upon many factors which, in 
turn, depend upon the ultimate function and use of the oligonucleotide. 

The term "primer" as used herein refers to an oligonucleotide, whether occurring 
20 naturally as in a purified restriction digest or produced synthetically, which is capable 
of acting as a point of initiation of synthesis when placed under conditions in which 
synthesis of a primer extension product, which is complementary to a nucleic acid 
strand, is induced, i.e., in the presence of nucleotides and an inducing agent such as a 
DNA polymerase and at a suitable temperature and pH. The primer may be either 
25 single-stranded or double-stranded and must be sufficiently long to prime the synthesis 
of the desired extension product in the presence of the inducing agent. The exact 
length of the primer will depend upon many factors, including temperature, source of 
primer and use of the method. For example, for diagnostic applications, depending on 
the complexity of the target sequence, the oligonucleotide primer typically contains 
30 1 5-25 or more nucleotides, although it may contain fewer nucleotides. 

The primers herein are selected to be "substantially" complementary to different 
strands of a particular target DNA sequence. This means that the primers must be 
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sufficiently complementary to hybridize with their respective strands. Therefore, the 
primer sequence need not reflect the exact sequence of the template. For example, a 
non-complementary nucleotide fragment may be attached to the 5' end of the primer, 
with the remainder of the primer sequence being complementary to the strand. 
5 Alternatively, non-complementary bases or longer sequences can be interspersed into 
the primer, provided that the primer sequence has sufficient complementarity with the 
sequence of the strand to hybridize therewith and thereby form the template for the 
synthesis of the extension product. 

10 As used herein, the terms "restriction endonucleases" and "restriction enzymes" refer 
to bacterial enzymes, each of which cut double-stranded DNA at or near a specific 
nucleotide sequence. 

A cell has been "transformed" by exogenous or heterologous DNA when such DNA 
15 has been introduced inside the cell. The transforming DNA may or may not be 

integrated (covalently linked) into chromosomal DNA making up the genome of the 
cell. In prokaryotes, yeast, and mammalian cells for example, the transforming DNA 
may be maintained on an episomal element such as a plasmid. With respect to 
eukaryotic cells, a stably transformed cell is one in which the transforming DNA has 
20 become integrated into a chromosome so that it is inherited by daughter cells through 
chromosome replication. This stability is demonstrated by the ability of the eukaryotic 
cell to establish cell lines or clones comprised of a population of daughter cells 
containing the transforming DNA. A "clone" is a population of cells derived from a 
single cell or common ancestor by mitosis. A "cell line" is a clone of a primary cell 
25 that is capable of stable growth in vitro for many generations. 

Two DNA sequences are "substantially homologous" when at least about 75% 
(preferably at least about 80%, and most preferably at least about 90 or 95%) of the 
nucleotides match over the defined length of the DNA sequences. Sequences that are 
30 substantially homologous can be identified by comparing the sequences using standard 
software available in sequence data banks, or in a Southern hybridization experiment 
under, for example, stringent conditions as defined for that particular system. Defining 
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appropriate hybridization conditions is witfdn the skill of the art. See, e.g., Maniatis et 
al., supra; DNA Cloning, Vols. I & II, supra; Nucleic Acid Hybridization, supra, 

A DNA sequence is "operatively linked" to an expression control sequence when the 
5 expression control sequence controls and regulates the transcription and translation of 
that DNA sequence. The term "operatively linked" includes having an appropriate 
start signal (e.g., ATG) in front of the DNA sequence to be expressed and maintaining 
the correct reading frame to permit expression of the DNA sequence under the control 
of the expression control sequence and production of the desired product encoded by 
10 the DNA sequence. If a gene that one desires to insert into a recombinant DNA 
molecule does not contain an appropriate start signal, such a start signal can be 
inserted in front of the gene. 

The term "standard hybridization conditions" refers to salt and temperature conditions 
15 substantially equivalent to 5 x SSC and 65 °C for both hybridization and wash. 
However, one skilled in the art will appreciate that such "standard hybridization 
conditions" are dependent on particular conditions including the concentration of 
sodium and magnesium in the buffer, nucleotide sequence length and concentration, 
percent mismatch, percent formamide, and the like. Also important in the 
20 determination of "standard hybridization conditions" is whether the two sequences 
hybridizing are RNA-RNA, DNA-DNA or RNA-DNA. Such standard hybridization 
conditions are easily determined by one skilled in the art according to well known 
formulae, wherein hybridization is typically 10-20°C below the predicted or 
determined T^^ with washes of higher stringency, if desired. 

25 

It should be appreciated that also within the scope of the present invention are DNA 
sequences encoding an Ema polypeptide EmaA, EmaB, EmaC, EmaD or EmaE which 
code for an Ema polypeptide having the same amino acid sequence as any of SEQ ID 
N0S:2, 4, 6, 8 or 10, but which are degenerate to any of SEQ ID NOS: 1, 3, 5, 7 or 9. 
30 By "degenerate to" is meant that a different three-letter codon is used to specify a 
particular amino acid. It is well known in the art that the following codons can be 
used interchangeably to code for each specific amino acid: 
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Phenylalanine (Phe or F) UUU or UUC 

Leucine (Leu or L) UUA or UUG or CUU or CUC or CUA or CUG 

Isoleucine (lie or I) AUU or AUG or AUA 

Methionine (Met or M) AUG 

5 Valine (Val or V) GUU or GUC of GUA or GUG 

Serine (Ser or S) UCU or UCC or UCA or UCG or AGU or AGC 

Proline (Pro or P) CCU or CCC or CCA or CCG 

Threonine (Thr or T) ACU or ACC or ACA or ACG 

Alanine (Ala or A) GCU or GCG or GCA or GCG 

10 Tyrosine (Tyr or Y) UAU or UAC 

Histidine(HisorH) CAUorCAC 

Glutamine (Gin or Q) CAA or GAG 

Asparagine (Asn or N) AAU or AAC 

Lysine (Lys or K) AAA or AAG 

15 Aspartic Acid (Asp or D) GAU or GAG 

Glutamic Acid (Glu or E) GAA or GAG 

Cysteine (Cys or C) UGU or UGC 

Arginine (Arg or R) CGU or GGG or GGA or GGG or AGA or AGG 

Glycine (Gly or G) GGU or GGG or GGA or GGG 

20 Tryptophan (Trp or W) UGG 

Termination codoh UAA (ochre) or UAG (amber) or UGA (opal) 

It should be understood that the codons specified above are for RNA sequences. The 
corresponding codons for DNA have a T substituted for U. 

25 

Mutations can be made in SEQ ID NOS: 1, 3, 5, 7 or 9 such that a particular codon is 
changed to a codon which codes for a different amino acid. Such a mutation is 
generally made by making the fewest nucleotide changes possible. A substitution 
mutation of this sort can be made to change an amino acid in the resulting protein in a 
30 non-conservative manner (i.e., by changing the codon from an amino acid belonging to 
a grouping of amino acids having a particular size or characteristic to an amino acid 
belonging to another grouping) or in a conservative manner (i.e., by changing the 
codon from an amino acid belonging to a grouping of amino acids having a particular 



wo 02/12294 



PCT/USOl/24795 



48 

size or characteristic to an amino acid belonging to the same grouping). Such a 
conservative change generally leads to less change in the structure and function of the 
resulting protein. A non-conservative change is more likely to alter the structure, 
activity or function of the resulting protein. The present invention should be 
5 considered to include sequences containing conservative changes which do not 
significantly alter the activity or binding characteristics of the resulting protein. 

Two amino acid sequences are "substantially homologous" when at least about 70% of 
the amino acid residues (preferably at least about 80%, and most preferably at least 
10 about 90 or 95%) are identical, or represent conservative substitutions. 

A "heterologous" region of the DNA construct is an identifiable segment of DNA 
within a larger DNA molecule that is not found in association with the larger molecule 
in nature. Thus, when the heterologous region encodes a mammalian gene, the gene 

15 will usually be flanked by DNA that does not flank the mammalian genomic DNA in 
the genome of the source organism. Another example of a heterologous coding 
sequence is a construct where the coding sequence itself is not fotmd in nature (e.g., a 
cDNA where the genomic coding sequence contains introns, or synthetic sequences 
having codons different than the native gene). Allelic variations or naturally-occurring 

20 mutational events do not give rise to a heterologous region of DNA as defined herein. 

A DNA sequence is "operatively linked" to an expression control sequence when the 
expression control sequence controls and regulates the transcription and translation of 
that DNA sequence. The term "operatively linked" includes having an appropriate 

25 start signal (e.g., ATG) in front of the DNA sequence to be expressed and maintaining 
the correct reading frame to permit expression of the DNA sequence under the control 
of the expression control sequence and production of the desired product encoded by 
the DNA sequence. If a gene that one desires to insert into a recombinant DNA 
molecule does not contain an appropriate start signal, such a start signal can be 

30 inserted in front of the gene. 



Further this invention also provides a vector which comprises the above-dpscribed 
nucleic acid molecule. The promoter may be, or is identical to, a bacterial, yeast. 
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insect or mammalian promoter. Further, tKe vector may be a plasmid, cosmid, yeast 
artificial chromosome (YAC), bacteriophage or eukaryotic viral DNA. Other 
numerous vector backbones known in the art as usefijl for expressing protein may be 
employed. Such vectors include, but are not limited to: adenovirus, simian virus 40 
5 (SV40), cytomegalovirus (CMV), mouse mammary tumor virus (MMTV), Moloney 
murine leukemia virus, DNA delivery systems, i.e. liposomes, and expression plasmid 
delivery systems. Such vectors may be obtained commercially or assembled from the 
sequences described by methods well-known in the art. 

10 This invention also provides a host vector system for the production of a polypeptide 
which comprises the vector of a suitable host cell. A wide variety of unicellular host 
cells are also useful in expressing the DNA sequences of this invention. These hosts 
may include well known eukaryotic and prokaryotic hosts, such as strains of E. coli^ 
Pseudomonas, Bacillus, Streptomyces, fungi such as yeasts, and animal cells, such as 

15 CHO, Rl.l, B-W and L-M cells, African Green Monkey kidney cells (e.g., COS 1, 
COS 7, BSCl, BSC40, and EMTIO), insect cells (e.g., Sf9), and human cells and 
plant cells in tissue culture. 

A wide variety of host/expression vector combinations may be employed in expressing 
20 the DNA sequences of this invention. Useful expression vectors, for example, may 
consist of segments of chromosomal, non-chromosomal and synthetic DNA 
sequences. Suitable vectors include derivatives of S V40 and known bacterial 
plasmids, e.g., coli plasmids col El, pCRl, pBR322, pMB9 and their derivatives, 
plasmids such as RP4; phage DNAs, e.g., the numerous derivatives of phage A, M13 
25 and filamentous single stranded phage DNA; yeast plasmids such as the 2\x plasmid or 
derivatives thereof; vectors useful in eukaryotic cells, such as vectors useful in insect 
or mammalian cells; vectors derived from combinations of plasmids and phage DNAs, 
such as plasmids that have been modified to employ phage DNA or other expression 
control sequences; and the like. 

30 

Any of a wide variety of expression control sequences ~ sequences that control the 
expression of a DNA sequence operatively linked to it ~ may be used in these vectors 
to express the DNA sequences of this invention. Such useful expression control 
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sequences include, for example, the early or late promoters of SV40, CMV, vaccinia, 
polyoma or adenovirus, the lac system, the trp system, the TAC system, the TRC 
system, the LTR system, the major operator and promoter regions of phage X, the 
control regions of fd coat protein, the promoter for 3-phosphoglycerate kinase or 
5 other glycolytic enzymes, the promoters of acid phosphatase (e.g., Pho5), the 

promoters of the yeast a-mating factors, and other sequences known to control the 
expression of genes of prokaryotic or eukaryotic cells or their viruses, and various 
combinations thereof. 

10 It will be understood that not all vectors, expression control sequences and hosts will 
function equally well to express the DNA sequences of this invention. Neither will all 
hosts function equally well with the same expression system. However, one skilled in 
the art will be able to select the proper vectors, expression control sequences, and 
hosts without undue experimentation to accomplish the desired expression without 

15 departing from the scope of this invention. For example, in selecting a vector, the host 
must be considered because the vector must function in it. The vector's copy number, 
the ability to control that copy number, and the expression of any other proteins 
encoded by the vector, such as antibiotic markers, will also be considered. 

20 In selecting an expression control sequence, a variety of factors will normally be 
considered. These include, for example, the relative strength of the system, its 
controllability, and its compatibility with the particular DNA sequence or gene to be 
expressed, particularly as regards potential secondary structures. Suitable unicellular 
hosts will be selected by consideration of, e.g., their compatibility with the chosen 

25 vector, their secretion characteristics, their ability to fold proteins correctly, and their 
fermentation requirements, as well as the toxicity to the host of the product encoded 
by the DNA sequences to be expressed, and the ease of purification of the expression 
products. 

30 This invention further provides a method of producing a polypeptide which comprises 
growing the above-described host vector system under suitable conditions permitting 
the production of the polypeptide and recovering the polypeptide so produced. 
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As used herein, "pg" means picogram, "ng" means nanogram, "ug" or "|ig" mean 
microgram, "mg" means milligram, "ul" or "^1" mean microliter, "ml" means milliliter, 
"1" means liter. 

5 The present invention extends to the preparation of antisense oligonucleotides and 
ribozymes that may be used to interfere with the expression of one or more Ema 
protein at the translational level. This approach utilizes antisense nucleic acid and 
ribozymes to block translation of a specific mRNA, either by masking that mRNA with 
an antisense nucleic acid or cleaving it with a ribozyme. 

10 

Antisense nucleic acids are DNA or RNA molecules that are complementary to at least 
a portion of a specific mRNA molecule. (See Weintraub, 1990; Marcus-Sekura, 1988.) 
In the cell, they hybridize to that mRNA, forming a double stranded molecule. The 
cell does not translate an mRNA in this double-stranded form. Therefore, antisense 

15 nucleic acids interfere with the expression of mRNA into protein. Oligomers of about 
fifteen nucleotides and molecules that hybridize to the AUG initiation codon will be 
particularly efficient, since they are easy to synthesize and are likely to pose fewer 
problems than larger molecules when introducing them into Ema-producing cells. 
Antisense methods have been used to inhibit the expression of many genes in vitro 

20 (Marcus-Sekura, 1988; Hambor et al., 1988). 

Ribozymes are RNA molecules possessing the ability to specifically cleave other single 
stranded RNA molecules in a manner somewhat analogous to DNA restriction 
endonucleases. Ribozymes were discovered from the observation that certain mRNAs 
25 have the ability to excise their OAvn introns. By modifying the nucleotide sequence of 
these RNAs, researchers have been able to engineer molecules that recognize specific 
nucleotide sequences in an RNA molecule and cleave it (Cech, 1988,). Because they 
are sequence-specific, only mRNAs with particular sequences are inactivated. 

30 Investigators have identified two types of ribozymes, Tetrahymena-type and 

"hammerhead"-type. (Hasselhoflfand Gerlach, 1988) Tetrahymena-type ribozymes 
recognize four-base sequences, while "hammerhead"-type recognize eleven- to 
eighteen-base sequences. The longer the recognition sequence, the more likely it is to 
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occur exclusively in the target mRNA species. Therefore, hammerhead-type 
ribozymes are preferable to Tetrahymejia-typo ribozymes for inactivating a specific 
mRNA species, and eighteen base recognition sequences are preferable to shorter 
recognition sequences. 

5 

Antibodies 

This invention further provides an antibody capable of specifically recognizing or 
binding to the isolated Ema polypeptide of the present invention. The antibody may be 
10 a monoclonal or polyclonal antibody. Further, the antibody may be labeled with a 
detectable marker that is either a radioactive, calorimetric, fluorescent, or a 
luminescent marker. The labeled antibody may be a polyclonal or monoclonal 
antibody. In one embodiment, the labeled antibody is a purified labeled antibody. 
Methods of labeling antibodies are well known in the art. 

15 

In a further aspect, the present invention provides a purified antibody to a bacterial 
Ema polypeptide, particularly a streptococcal Ema polypeptide. In a still further 
aspect, the present invention provides a purified antibody to a Group B sreptococcal 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. 

20 

Antibodies against the isolated polypeptides of the present invention include naturally 
raised and recombinantly prepared antibodies. These may include both polyclonal and 
monoclonal antibodies prepared by known genetic techniques, as well as bi-specific 
(chimeric) antibodies, and antibodies including other functionalities suiting them for 
25 diagnostic use. Such antibodies can be used in immunoassays to diagnose infection 
with a particular strain or species of bacteria. The antibodies can also be used for 
passive immunization to treat an infection with Group B streptococcal bacteria. These 
antibodies may also be suitable for modulating bacterial adherence and/or invasion 
including but not limited to acting as competitive agents. 

30 

The present invention provides a monoclonal antibody to a Group B streptococcal 
poypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. The 
invention thereby extends to an immortal cell line that produces a monoclonal antibody 
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to a Group B streptococcal poypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE. 

An antibody to an Ema polypeptide, particularly selected from EmaA, EmaB, EmaC. 
5 EmaD or EmaE, labeled with a detectable label is further provided. In particular 
embodiments, the label may selected from the group consisting of an enzyme, a 
chemical which fluoresces, and a radioactive element. 

The term "antibody" includes, by way of example, both naturally occurring and non- 
10 naturally occurring antibodies. Speqfically, the term "antibody" includes polyclonal 
and monoclonal antibodies, and fragments thereof Furthermore, the term "antibody" 
includes chimeric antibodies and wholly synthetic antibodies, and fragments thereof 
Such antibodies include but are not limited to polyclonal, monoclonal, chimeric, single 
chain, Fab fragments, and an Fab expression library. 

15 

An "antibody" is any immunoglobulin, including antibodies and fragments thereof, that 
binds a specific epitope. The term encompasses polyclonal, monoclonal, and chimeric 
antibodies, the last mentioned described in further detail in U.S. Patent Nos. 4,816,397 
and 4,816,567. 

20 

An "antibody combining site" is that structural portion of an antibody molecule 
comprised of heavy and light chain variable and hypervariable regions that specifically 
binds antigen. 

25 The phrase "antibody molecule" in its various grammatical forms as used herein 

contemplates both an intact immunoglobulin molecule and an immunologically active 
portion of an immunoglobulin molecule. 

Exemplary antibody molecules sire intact immunoglobulin molecules, substantially 
30 intact immunoglobulin molecules and those portions of an immunoglobulin molecule 
that contains the paratope, including those portions known in the art as Fab, Fab', 
F(ab')2 ^i^d F(v), which portions are preferred for use in the therapeutic methods 
described herein. 
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Fab and F(ab*)2 portions of antibody molecules are prepared by the proteol5rtic reaction 
of papain and pepsin, respectively, on substantially intact antibody molecules by 
methods that are well-known. See for example, U.S. Patent No. 4,342,566 to 
Theofilopolous et al. Fab' antibody molecule portions are also well-known and are 
5 produced from F(ab')2 portions followed by reduction of the disulfide bonds linking the 
two heavy chain portions as with mercaptoethanol, and followed by alkylation of the 
resulting protein mercaptan with a reagent such as iodoacetamide. An antibody 
containing intact antibody molecules is preferred herein. 

The phrase "monoclonal antibody" ip its various granmiatical forms refers to an 
antibody having only one species of antibody combining site capable of 
immunoreacting with a particular antigen. A monoclonal antibody thus typically 
displays a single binding affinity for any antigen with which it immunoreacts. A 
monoclonal antibody may therefore contain an antibody molecule having a plurality of 
antibody combining sites, each immunospecific for a different antigen; e.g., a bispecific 
(chimeric) monoclonal antibody. 

Various procedures known in the art may be used for the production of polyclonal 
antibodies to polypeptide or derivatives or analogs thereof (^-e^, e.g,^ Antibodies ~ A 
20 Laboratory Manual, Harlow and Lane, eds.. Cold Spring Harbor Laboratory Press: 
Cold Spring Harbor, New York, 1988). For the production of antibody, various host 
animals can be immunized by injection with the Group B streptococcal Ema 
polypeptide, an immunogenic fragment thereof, or a derivative {e.g,^ fragment or 
fusion protein) thereof, including but not limited to rabbits, mice, rats, sheep, goats, 
25 etc. In one embodiment, the polypeptide can be conjugated to an immunogenic 
carrier, e.g., bovine serum albumin (BSA) or keyhole limpet hemocyanin (KLH). 
Various adjuvant may be used to increase the immunological response, depending on 
the host species. 

30 For preparation of monoclonal antibodies, or fi-agment, analog, or derivative thereof, 
any technique that provides for the production of antibody molecules by continuous 

cell lines in culture may be used (see, e,g^ Antibodies — ^ Laboratory Manual^ 
Harlow and Lane, eds.. Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 



10 



15 
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New York, 1988). These include but are not limited to the hybridoma technique 
originally developed by Kohler and Milstein (1975, Nature 256:495-497), as well as 
the trioma technique, the human B-cell hybridoma technique (Kozbor et al, 1983, 
Immunology Today 4:72), and the EBV-hybridoma technique to produce human 
5 monoclonal antibodies (Cole et al., 1985, m Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Monoclonal antibodies can be produced in 
germ-free animals utilizing recent technology (PCT/US90/02545). Human antibodies 
may be used and can be obtained by using human hybridomas (Cote et al., 1983, Proc. 
Natl. Acad. Sci. U.S.A. 80:2026-2030) or by transforming human B cells with EBV 

10 virus in vitro (Cole et al., 1985, in hfonoclonal Antibodies and Cancer Therapy^ Alan 
R. Liss, pp. 77-96). In fact, according to the invention, techniques developed for the 
production of "chimeric antibodies" (Morrison e/ a/., 1984,7. Bacteriol 159-870; 
Neuberger et al, 1984, Nature 3 12:604-608; Takeda et al, 1985, Nature 3 14:452- 
454) by splicing the genes from a mouse antibody molecule specific for a polypeptide 

15 together with genes from a human antibody molecule of appropriate biological activity 
can be used; such antibodies are within the scope of this invention. Such human or 
humanized chimeric antibodies are preferred for use in therapy of human infections or 
diseases, since the human or humanized antibodies are much less likely than xenogenic 
antibodies to induce an immune response, in particular an allergic response, 

20 themselves. An additional embodiment of the invention utilizes the techniques 

described for the construction of Fab expression libraries (Huse et aL, 1989, Science 
246:1275-1281) to allow rapid and easy identification of monoclonal Fab fragments 
with the desired specificity for the polypeptide, or its derivatives, or analogs. 

25 Antibody fragments which contain the idiotype of the antibody molecule can be 
generated by known techniques. For example, such fragments include but are not 
limited to: the F(ab*)2 fragment which can be produced by pepsin digestion of the 
antibody molecule; the Fab* firagments which can be generated by reducing the 
disulfide bridges of the F(ab')2 firagment, and the Fab fragments which can be 

30 generated by treating the antibody molecule with papain and a reducing agent. 

In the production of antibodies, screening for the desired antibody can be • 
accomplished by techniques known in the art, e.g., radioimmunoassay, ELISA 
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(enzyme-linked immunosorbant assay), "sandwich" immunoassays, immunoradiometric 
assays, gel diffusion precipitin reactions, immunodifiusion assays, in situ 
immunoassays (using colloidal gold, enzyme or radioisotope labels, for example), 
western blots, precipitation reactions, agglutination assays {e.g., gel agglutination 
5 assays, hemagglutination assays), complement fixation assays, immunofluorescence 
assays, protein A assays, and Immunoelectrophoresis assays, etc. In one embodiment, 
antibody binding is detected by detecting a label on the primary antibody. In another 
embodiment, the primary antibody is detected by detecting binding of a secondary 
antibody or reagent to the primary antibody. In a further embodiment, the secondary 
10 antibody is labeled. Many means ar§ known in the art for detecting binding in an 
immunoassay and are within the scope of the present invention. 

Antibodies can be labeled for detection in vitro, e.g., with labels such as enzymes, 
fluorophores, chromophores, radioisotopes, dyes, colloidal gold, latex particles, and 
15 chemiluminescent agents. Alternatively, the antibodies can be labeled for detection in 
vivo, e.g., with radioisotopes (preferably technetium or iodine); magnetic resonance 
shift reagents (such as gadolinium and manganese); or radio-opaque reagents. 

The labels most commonly employed for these studies are radioactive elements, 
20 enzymes, chemicals which fluoresce when exposed to ultraviolet light, and others. A 
number of fluorescent materials are known and can be utilized as labels. These 
include, for example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue and 
Lucifer Yellow. A particular detecting material is anti-rabbit antibody prepared in 
goats and conjugated with fluorescein through an isothiocyanate. The polypeptide can 
25 also be labeled with a radioactive element or with an enzyme. The radioactive label 
can be detected by any of the currently available counting procedures. The preferred 
isotope may be selected from ^H, ''C, ^% ^'Cl, ^^Cr, ^^Co, ^«Co, ^^e, ^°Y, 
and ^«'Re. 

30 Enzyme labels are likewise useful, and can be detected by any of the presently utilized 
calorimetric, spectrophotometric, fluorospectrophotometric, amperometric or 
gasometric techniques. The enzyme is conjugated to the selected particle»by reaction 
with bridging molecules such as carbodiimides, diisocyanates, glutaraldehyde and the 
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like. Many enzymes which can be used in these procedures are known and can be 
utilized. The preferred are peroxidase, B-glucuronidase, B-D-glucosidase, 

B-D-galactosidase, urease, glucose oxidase plus peroxidase and alkaline phosphatase. 
U.S. Patent Nos. 3,654,090; 3,850,752; and 4,016,043 are referred to by way of 
5 example for their disclosure of alternate labeling material and methods. 

Diagnostic Applications 

10 The present invention also relates to a variety of diagnostic applications, including 
methods for identifying or monitoring streptococcal infections. The present invention 
also relates to a variety of diagnostic applications, including methods for identifying or 
monitoring Group B streptococcal infections. The present invention further relates to 
diagnostic applications or methods utilizing the polypeptides of the present invention, 

15 immunogenically recognized fragments thereof, or antibodies thereto. Such methods 
include the analysis and evaluation of agents, analogs or compounds which modulate 
the activity of the Ema polypeptides. The Ema polypeptides may also be utilized in 
diagnostic methods and assays for monitoring and determining immunological 
response and antibody response upon streptococcal infection or vaccination. 

20 

As described in detail above, antibody(ies) to the Ema polypeptides or fragments 
thereof can be produced and isolated by standard methods including the well known 
hybridoma techniques. For convenience, the antibody(ies) to the Ema polypeptides 
will be referred to herein as Abi and antibody(ies) raised in another species as Abj. 

25 

The presence of streptococci in cells can be ascertained by the usual immunological 
procedures applicable to such determinations. A number of useful procedures are 
known. Procedures which are especially useful utilize either the Ema polypeptides 
labeled with a detectable label, antibody against the Ema polypeptides labeled with a 
30 detectable label, or secondary antibody labeled with a detectable label. 

The procedures and their application are all familiar to those skilled in the. art and 
accordingly may be utilized within the scope of the present invention. The 
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"competitive" procedure, is described in U.S. Patent Nos. 3,654,090 and 3,850,752. 
The "sandwich" procedure, is described in U.S. Patent Nos. RE 3 1,006 and 4,016,043. 
Still other procedures are known such as the "double antibody," or "DASP" 
procedure. 

5 

In each instance, the Ema polypeptides forms complexes with one or more 
antibody(ies) or binding partners and one member of the complex is labeled with a 
detectable label. The fact that a complex has formed and, if desired, the amount 
thereof, can be determined by known methods applicable to the detection of labels. 

10 

In a further embodiment of this invention, commercial test kits suitable for use by a 
medical specialist may be prepared to determine the presence or absence of 
stretococci, particularly of streptococci expressing one or more Ema polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. In as much as the 

15 ema locus, as described herein, is found in the genomic DNA of many, if not all, 
serotypes of Group B streptococci, it is a useful general marker for Group B 
streptococci. In as much as Ema homologs exist in other species of streptococci, 
including Group A and S. pneumoniae, it is a useful general marker for streptococci. 
Therefore, commercial test kits for determining the presence or absence of 

20 streptococci, and thereby determining whether an individual is infected with 

streptococci are contemplated and provided by this invention. Therefore, commercial 
test kits for determining the presence or absence of Group B streptococci, and thereby 
determining whether an individual is infected with Group B streptococci are 
contemplated and provided by this invention. 

25 

The present invention includes methods for determining and monitoring infection by 
streptococci by detecting the presence of a streptococcal polypeptide selected from the 
group of EmaA, EmaB, EmaC, EmaD and EmaE. In a particular such method, the 
streptococcal Ema polypeptide is measured by: 
30 a. contacting a sample in which the presence or activity of a Streptococcal 

polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD 
and EmaE is suspected with an antibody to the said streptococcal 
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polypeptide under conditions that allow binding of the streptococcal 
polypeptide to the antibody to occur; and 
b. detecting whether binding has occurred between the streptococcal 
polypeptide from the sample and the antibody; 
5 wherein the detection of binding indicates the presence or activity of the streptococcal 
polypeptide in the sample. 



The present invention includes methods for determining and monitoring infection by 
10 Group B streptococci by detecting the presence of a Group B streptococcal 

polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE. In a 
particular such method, the streptococcal Ema polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a Group B 
Streptococcal polypeptide selected from the group of EmaA, EmaB, 

15 EmaC, EmaD and EmaE is suspected with an antibody to the said 

Group B streptococcal polypeptide under conditions that allow binding 
of the Group B streptococcal polypeptide to the antibody to occur; and 

b. detecting whether binding has occurred between the Group B 
20 streptococcal polypeptide from the sample and the antibody; 

wherein the detection of binding indicates the presence or activity of the a Group B 
streptococcal polypeptide in the sample. 

The present invention further provides a method for detecting the presence of a 
25 bacterium having a gene encoding a Group B polypeptide selected from the group of 
emoA, emaB, emaC, emaD and emaE, comprising: 

a. contacting a sample in which the presence or activity of the bacterium 
is suspected with an oligonucleotide which hybridizes to a Group B 
streptococcal polypeptide gene selected from the group ofemoA, 

30 emaB, emaQ emaD and emaE, under conditions that allow specific 

hybridization of the oligonucleotide to the gene to occur; and 

b. detecting whether hybridization has occurred between the • 
oligonucleotide and the gene; 
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wherein the detection of hybridization indicates that presence or activity of the 
bacterium in the sample. 

The invention includes an assay system for screening of potential compounds effective 
5 to modulate the activity of a bacterial Ema protein of the present invention. In one 
instance, the test compound, or an extract containing the compound, could be 
administered to a cellular sample expressing the particular Ema protein to determine 
the compound's effect upon the activity of the protein by comparison with a control 
In a further instance the test compound, or an extract containing the compound, could 
10 be administered to a cellular sample expressing the Ema protein to determine the 
compound's effect upon the activity of the protein, and thereby on adherence of said 
cellular sample to host cells, by comparison with a control. 

Accordingly, a test kit may be prepared for the demonstration of the presence of Ema 
15 polypeptide or Ema activity in cells, comprising: 

(a) a predetermined amount of at least one labeled immunochemically reactive 
component obtained by the direct or indirect attachment of the Ema polypeptide or a 
specific binding partner thereto, to a detectable label; 

(b) other reagents; and 

20 (c) directions for use of said kit. 

More specifically, the diagnostic test kit may comprise: 

(a) a known amount of the Ema polypeptide as described above (or a binding 
partner) generally bound to a solid phase to form an immunosorbent, or in the 

25 alternative, bound to a suitable tag, or plural such end products, etc. (or their binding 
partners) one of each; 

(b) if necessary, other reagents; and 

(c) directions for use of said test kit. 

30 In a further variation, the test kit may be prepared and used for the purposes stated 
above, which operates according to a predetermined protocol (e.g. "competitive," 
"sandwich," "double antibody," etc.), and comprises: 
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(a) a labeled component which has been obtained by coupling the Ema polypeptide 
to a detectable label; 

(b) one or more additional immunochemical reagents of which at least one reagent 
is a ligand or an immobilized ligand, which ligand is selected from the group consisting 

5 of: 

(i) a ligand capable of binding with the labeled component (a); 

(ii) a ligand capable of binding with a binding partner of the labeled 
component (a); 

(iii) a ligand capable of binding with at least one of the component(s) to be 
10 determined; and 

(iv) a ligand capable of binding with at least one of the binding partners of 
at least one of the component(s) to be determined; and 

(c) directions for the performance of a protocol for the detection and/or 
determination of one or more components of an immunochemical reaction between the 

15 Ema polypeptide and a specific binding partner thereto. 

In accordance with the above, an assay system for screening potential drugs effective 
to modulate the activity of the Ema polypeptide may be prepared. The Ema 
polypeptide may be introduced into a test system, and the prospective drug may also 
20 be introduced into the resulting cell culture, and the culture thereafter examined to 
observe any changes in the Ema polypeptide activity of the cells, due either to the 
addition of the prospective drag alone, or due to the effect of added quantities of the 
known Ema polypeptide. 

25 Therapeutic Applications 

The therapeutic possibilities that are raised by the existence of the Group B 
streptococcal Ema polypeptides EmaA, EmaB, EmaC, EmaD and EmaE derive from 
the fact that the Ema polypeptides of the present invention are found generally in 
30 various serot5^es of Group B streptococci. In addition, broader therapeutic 

possibilities that are raised by the existence of Ema homologous polypeptides in 
various distinct species of streptococci, including S. pneumoniae and S, pyogenes. In 
addition Ema homologous polypeptides have been identified in E. faecalis and C 
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diptheriae. Of particular relevance to their suitability in vaccine and immunological 
therapy is that the Ema A, EmaB, and EmaC polypeptides possess N-terminal 
sequences consistent with a signal peptide, indicating secretion from the bacterial cell 
and at least partial extracellular localization. In addition, the EmaA, EmaB, EmaC, 
5 EmaD and EmaE polypeptides demonstrate homology to distinct bacterial proteins 
involved in or implicated in bacterial adhesion and invasion. Thus, the Ema 
polypeptides are anticipated to be involved in or required for streptococcal adhesion to 
and/or invasion of cells, critical for bacterial survival and virulence in the human host, 

1 0 Modulators of Extracellular Matrix Adhesin Protein 

Thus, in instances where it is desired to reduce or inhibit the effects resulting from the 
extracellular matrix adhesin protein Ema of the present invention, an appropriate 
inhibitor of one or more of the Ema proteins, particularly EmaA, EmaB, EmaC, EmaD 
15 and EmaE could be introduced to block the activity of one or more Ema protein. 

The present invention contemplates screens for a modulator of an Ema polypeptide, in 
particular modulating adhesion or invasion facilitated by EmaA, EmaB, EmaC, EmaD 
or EmaE. In one such embodiment, an expression vector containing the Ema 

20 polypeptide of the present invention, or a derivative or analog thereof, is placed into a 
cell in the presence of at least one agent suspected of exhibiting Ema polypeptide 
modulator activity. The cell is preferably a bacterial cell, most preferably a 
streptococcal cell, or a bacterial host cell. The amount of adhesion or binding activity 
is determined and any such agent is identified as a modulator when the amount of 

25 adhesion or binding activity in the presence of such agent is different than in its 

absence. The vectors may be introduced by any of the methods described above. In a 
related embodiment the GBS Ema polypeptide is expressed in streptococci and the 
step of determining the amount of adhesion or binding activity is performed by 
determining the amount of binding to bacterial host cells cells in vitro, 

30 

When the amount of adhesion or binding activity in the presence of the modulator is 
greater than in its absence, the modulator is identified as an agonist or activator of the 
Ema polypeptide, whereas when the amount of adhesion binding activity in the 
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presence of the modulator is less than in its absence, the modulator is identified as an 
antagonist or inhibitor of the Ema polypeptide. As any person having skill in the art 
would recognize, such determinations as these and those below could require some 
form of statistical analysis, which is well within the skill in the art. 

5 

Natural effectors found in cells expressing Ema polypeptide can be fractionated and 
tested using standard effector assays as exemplified herein, for example. Thus an 
agent that is identified can be a naturally occurring adhesion or binding modulator. 
Alternatively, natural products libraries can be screened using the assays of the present 
10 invention for screening such agents.^ 

Another approach uses recombinant bacteriophage to produce large libraries. Using 
the ''phage method" [Scott and Smith, 1990, Science 249:386-390 (1990); Cwirla, et 
al., Proc. Natl Acad ScL, 87:6378-6382 (1990); Devlin et aL, Science, 249:404-406 

15 (1990)], very large libraries can be constructed (10^-10^ chemical entities). Yet 
another approach uses primarily chemical methods, of which the Geysen method 
[Geysen et al.. Molecular Immunology 23:709-715 (1986); Geysen et al. J. 
Immunologic Method 102:259-274 (1987)] and the method of Fodor et al. {Science 
251:767-773 (1991)] are examples, Furka et al. [14th International Congress of 

20 Biochemistry, Volume 5, Abstract FR:013 (1988); Furka, Int. J, Peptide Protein Res. 
37:487-493 (1991)], Houghton [U.S. Patent No. 4,631,211, issued December 1986] 
and Rutter et al. [U.S. Patent No. 5,010,175, issued April 23, 1991] describe methods 
to produce a mixture of peptides that can be tested. 

25 In another aspect, synthetic libraries [Needels et al., Proc. Natl. Acad. Set USA 
90:10700-4 (1993); Ohlmeyer et al, Proc. Natl. Acad. Set USA 90:10922-10926 
(1993); Lam et al. International Patent PubUcation No. WO 92/00252; Kocis et al.. 
International Patent Publication No. WO 9428028, each of which is incorporated 
herein by reference in its entirety], and the like can be used to screen for such an agent. 

30 

This invention provides antagonist or blocking agents which include but are not limited 
to: peptide fragments, mimetic, a nucleic acid molecule, a ribozyme, a polypeptide, a 
small molecule, a carbohydrate molecule, a monosaccharide, an oligosaccharide or an 
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antibody. Also, agents which competitively block or inhibit streptococcal bacterium 
are contemplated by this invention. This invention provides an agent which comprises 
an inorganic compound, a nucleic acid molecule, an oligonucleotide, an organic 
compound, a peptide, a peptidomimetic compound, or a protein which inhibits the 
5 polypeptide. 

Vaccines 

10 In a further aspect, the present invention extends to vaccines based on the Ema 

proteins described herein. The present invention provides a vaccine comprising one or 
more Group B streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE, and a pharmaceutical^ acceptable adjuvant. The present 
invention provides a vaccine comprising one or more bacterial Ema polypeptide 

15 selected from the group of polypeptides comprising the amino acid sequence set out in 
any of SEQ ID NO: 23, 26, 29, 32 and 37, and a pharmaceutically acceptable 
adjuvant. 

The present invention further provides a vaccine comprismg one or more Group B 
20 streptococcal polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, further comprising one or more additional GBS antigen. The present 
invention further provides a vaccine comprising one or more Group B streptococcal 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, 
further comprising one or more antigens selected from the group of the polypeptide 
25 Spbl or an immunogenic fragment thereof, the polypeptide Spb2 or an immunogenic 
fragment thereof, C protein alpha antigen or an immunogenic fragment thereof. Rib or 
an immunogenic fragment thereof, Lmb or an immunogenic fragment thereof, C5a-ase 
or an immunogenic fragment thereof, and Group B streptococcal polysaccharides or 
oUgosaccharides. 

30 

In another aspect, the invention is directed to a vaccine for protection of an animal 
subject from infection with streptococci comprising an immunogenic amount of one or 
more streptococcal Ema polypeptide, or a derivative or fragment thereof The Ema 



wo 02/12294 



PCT/USOl/24795 



65 

polypeptide may be particularly selected from the group of EmaA, EmaB, EmaC, 
EmaD or EmaE, or a derivative or fragment thereof. In a further aspect, the invention 
is directed to a vaccine for protection of an animal subject from infection with 
streptococci comprising an immunogenic amount of one or more Ema polypeptide 
5 EmaA, EmaB, EmaC, EmaD or EmaE, or a derivative or fragment thereof In a 

further aspect, the invention is directed to a vaccine for protection of an animal subject 
from infection with GBS comprising an immunogenic amount of one or more Ema 
polypeptide EmaA, EmaB, EmaC, EmaD or EmaE, or a derivative or fragment 
thereof Such a vaccine may contain the protein conjugated covalently to a 
10 streptococcal or GBS bacterial polysaccharide or oligosaccharide or polysaccharide or 
oligosaccharide from one or more streptococcal or GBS serotypes. 

This invention provides a vaccine which comprises a polypeptide bacterial Ema 
protein and a pharmaceutical^ acceptable adjuvant or carrier. In particular, a vaccine 

15 is provided which comprises one or more Ema polypeptides selected from the group of 
EmaA, EmaB, EmaC, EmaD and EmaE. This invention provides a vaccine which 
comprises a combination of at least one bacterial Ema protein selected from the group 
of EmaA, EmaB, EmaC, EmaD and EmaE and at least one other Group B 
streptococcal protein particularly Spbl and/or Spb2 and/or C protein alpha antigen, 

20 and a pharmaceutically acceptable adjuvant or carrier. The Ema polypeptide may 
comprise an amino acid sequence of a Ema protein EmaA, EmaB, EmaC, EmaD, 
EmaE as set forth in FIGURES 2-6 and SEQ ID NOS: 2, 4, 6, 8 and 10. 

This invention further provides a vaccine comprising an isolated nucleic acid encoding 
25 a bacterial Ema polypeptide and a pharmaceutically acceptable adjuvant or carrier. 
This invention further provides a vaccine comprising an isolated nucleic acid encoding 
a streptococcal Ema polypeptide and a pharmaceutically acceptable adjuvant or 
carrier. This invention further provides a vaccine comprising an isolated nucleic acid 
encoding a GBS Ema polypeptide and a pharmaceuticdly acceptable adjuvant or 
30 carrier. This invention further provides a vaccine comprising isolated nucleic acid 
encoding one or more GBS Ema polypeptide, particularly selected from the group of 
EmaA, EmaB, EmaC, EmaD and EmaE and a pharmaceutically acceptable adjuvant 
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or carrier. The nucleic acid may comprise a nucleic acid sequence of a GBS Ema 
polypeptide as set forth in any of SEQ ID NOS:l, 3, 5, 7, or 9. 

Active immunity against streptococci can be induced by immunization (vaccination) 
5 with an immunogenic amount of the polypeptide, or peptide derivative or fragment 
thereof, and an adjuvant, wherein the polypeptide, or antigenic derivative or fragment 
thereof, is the antigenic component of the vaccine. The polypeptide, or antigenic 
derivative or fragment thereof, may be one antigenic component, in the presence of 
other antigenic components in a vaccine. For instance, the polypeptide of the present 

10 invention may be combined with oti\er known streptococcal polypeptides or 

poty/oligosaccharides, or immunogenic fragments thereof, including for instance GBS 
capsular polysaccharide, Spbl, Spb2, C protein alpha antigen, Rib, Lmb, and C5a-ase 
in a multi-component vaccine. Such multi-component vaccine may be utilized to 
enhance immune response, even in cases where the polypeptide of the present 

15 invention elicits a response on its own. The polypeptide of the present invention may 
also be combined with existing vaccines, whole bacterial or capsule-based vaccines, 
alone or in combination with other GBS polypeptides, particularly Spbl and/or Spb2 
and/or C proteni alpha antigen and/or Rib to enhance such existing vaccines. 

20 The term "adjuvant" refers to a compound or mixture that enhances the immune 

response to an antigen. An adjuvant can serve as a tissue depot that slowly releases 
the antigen and also as a lymphoid system activator that non-specifically enhances the 
immune response (Hood et al. Immunology, Second Ed, 1984, Benjamin/Cummings: 
Menlo Park, California, p. 384). Often, a primary challenge with an antigen alone, in 

25 the absence of an adjuvant, will fail to elicit a humoral or cellular immune response. 
Adjuvant include, but are not limited to, complete Freund's adjuvant, incomplete 
Freund's adjuvant, saponin, mineral gels such as aluminum hydroxide, surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil or 
hydrocarbon emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially 

30 useful human adjuvant such as BCG {bacille Calmette-Gueriri) and Corynebacterium 
parvum. Preferably, the adjuvant is pharmaceutical^ acceptable. 



wo 02/12294 



PCT/USOl/24795 



67 

The invention further provides a vaccine which comprises a non-adherent, non-virulent 
mutant, including but not limited to the ema mutants herein described and 
contemplated. Medaglini et al (Madaglini et al (1 995) Proc Natl Acad Sci USA 
92;6868-6872) and Oggioni and Pozzi (Oggioni, M.R. and Pozzi, G. (1996) Gene 
5 1 69: 85-90) have previously described the use of Streptococcus gordonii, a commensal 
bacterium of the human oral cavity, as live vaccine delivery vehicles and for 
heterologous gene expression. Such ema' mutant can therefore be utilized as a 
vehicle for expression of immunogenic proteins for the purposes of eliciting an 
immune response to such other proteins in the context of vaccines. Active immunity 
10 against Group B streptococci, can bie induced by immunization (vaccination) with an 
immunogenic amount of the ema' vehicle expressing an immunogenic protein. Also 
contemplated by the present invention is the use of any such ema' mutant in 
expressing a therapeutic protein in the host in the context of other forms of therapy. 

The polypeptide of the present invention, or fragments thereof, can be prepared in an 
admixture with an adjuvant to prepare a vaccine. Preferably, the polypeptide or 
peptide derivative or fragment thereof, used as the antigenic component of the 
vaccine is an antigen common to all or many serotypes of GBS bacteria, or common to 
closely related species of bacteria, for instance Streptococcus. 

Vectors containing the nucleic acid-based vaccine of the invention can be introduced 
into the desired host by methods known in the art, e,g., transfection, eiectroporation, 
micro injection, transduction, cell fusion, DEAE dextran, calcium phosphate 
precipitation, lipofection (lysosome fusion), use of a gene gun, or a DNA vector 
transporter (see, e,g, Wu et al, 1992, J, Biol Chem. 267:963-967; Wu and Wu, 
1988, J, Biol Chem, 263:14621-14624; Hartmut et al, Canadian Patent Application 
No. 2,012,311, filed March 15, 1990). 

The modes of administration of the vaccine or compositions of the present invention 
30 may comprise the use of any suitable means and/or methods for delivering the vaccine 
or composition to the host animal whereby they are immumostimulatively effective. 
Delivery modes may include, without limitation, parenteral administration- methods, 
such as paracancerally, transmucosally, transdermally, intramuscularly, intravenously. 



15 



20 



25 
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intradermally, subcutaneously, iatraperitonealy, intraventricularly, intracranially and 
intratumorally. Preferably, since the desired result of vaccination is to elucidate an 
immune response to the antigen, and thereby to the pathogenic organism, 
administration directly, or by targeting or choice of a viral vector, indirectly, to 
5 lymphoid tissues, e.g., lymph nodes or spleen, is desirable. Since immune cells are 
continually replicating, they are ideal target for retroviral vector-based'nucleic acid 
vaccines, since retroviruses require replicating cells. These vaccines and compositions 
can be used to immunize mammals, for example, by the intramuscular or parenteral 
routes, or by delivery to mucosal surfaces using microparticles, capsules, liposomes 

10 and targeting molecules, such as toxjns and antibodies. The vaccines and 

immunogenic compositions may be administered to mucosal surfaces by, for example, 
the nasal or oral (intragastric) routes. Alternatively, other modes of administration 
including suppositories may be desirable. For suppositories, binders and carriers may 
include, for example, polyalkylene glycols and triglycerides. Oral formulations may 

15 include normally employed incipients, such as pharmaceutical grades of saccharine, 
cellulose and magnesium carbonate. 

These compositions may take the form of solutions, suspensions, tablets, pills, 
capsules, sustained release formulations or powders and contain 1 to 95% of the 

20 immunogenic compositions of the present invention. The immunogenic compositions 
are administered in a manner compatible with the dosage formulation, and in such 
amount as to be therapeutically effective, protective and immunogenic. The quantity to 
be administered depends on the subject to the immunized, including, for example, the 
capacity of the subject's immune system to synthesize antibodies, and if needed, to 

25 produce a cell-mediated, humoral or antibody-mediated immune response. Precise 
amounts of antigen and immunogenic composition to be administered depend on the 
judgement of the practitioner. However, suitable dosage ranges are readily 
determinable by those skilled in the art and may be of the order of micrograms to 
milligrams. Suitable regimes for initial administration and booster doses are also 

30 variable, but may include an initial administration followed by subsequent 
administrations. The dosage of the vaccine may also depend on the route of 
administration and will vary according to the size of the host. 
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Passive immunity can be conferred to an animal subject suspected of suflfering an 
infection with streptococci by administering antiserum, polyclonal antibodies, or a 
neutralizing monoclonal antibody against one or more Ema polypeptide of the 
invention to the patient. A combination of antibodies directed against one or more 
5 Ema polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, 
in combination with one or more of antibodies against Spbl, Spb2, Rib and C protein 
alpha antigen is also contemplated by the present invention. Although passive 
immunity does not confer long term protection, it can be a valuable tool for the 
treatment of a bacterial infection in a subject who has not been vaccinated. Passive 

10 immunity is particularly important fqr the treatment of antibiotic resistaat strains of 
bacteria, since no other therapy may be available. Preferably, the antibodies 
administered for passive immune therapy are autologous antibodies. For example, if 
the subject is a human, preferably the antibodies are of human origin or have been 
"humanized," in order to minimize the possibility of an immune response against the 

15 antibodies. The active or passive vaccines of the invention can be used to protect an 
animal subject from infection by streptococcus, particularly Group B streptococcus. 

Vaccines for GBS have been previously generated and tested. Preliminary vaccines 
used unconjuated purified polysaccaride. GBS polysaccharides and oligosaccharides 

20 are poorly immunogenic and fail to elicit significant memory and booster responses. 
Baker et al immunized 40 pregnant women with purified serotype HI capsular 
polysaccharide (Baker, CJ. et al. (199S) New EnglJ of Med 319:11S0A135). 
Overall, only 57% of women with low levels of specific antibody responded to the the 
vaccine. The poor immunogenicity of purified polysaccharide antigen was fiarther 

25 demonstrated in a study in which thirty adult volunteers were immunized with a 

tetravalent vaccine composed of purified polysaccharide from serotypes la, lb, II, and 
III (Kotloff, K.L. et al. (1996) Vaccine 14:446-450). Although safe, this vaccine was 
only modestly immunogenic, with only 13% of subjects responding to type lb, 17% to 
type II, 33% responding to type la, and 70%responding to type III polysaccharide. 

30 The poor immunogenicity of polysaccaride antigens prompted efforts to develop 

polysaccharide conjugate vaccines, whereby these polysaccharides or oligosaccharides 
are conjugated to protein carriers. Ninety percent of healthy adult women immunized 
with a type III polysaccharide-tetanus toxoid conjugate vaccine responded with a 
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4- fold rise in antibody concentration, compared to 50% immunized with plain 
polysaccharide (Kasper, D.L. et al (1996) J of Clin Invest 98:2308-23 14). A type 
la/Ib polysaccharide-tetanus toxoid conjugate vaccine was similarly more 
immunogenic in healthy adults than plain polysaccharide (Baker, C.J. et al (1999) J 

5, Infect Dis 119:142-150). 

The general method for the conjugation of polysaccharide is described in Wessels et al 
(Wessels, M.R. et al (1990) J. Clin Investigation 86: 1428-1433). Prior to coupling 
with tetanus toxoid, aldehyde groups are introduced on the polysaccharide by 
10 controlled periodate oxidation, resulting m the conversion of a portion of the sialic 
acid residues of the polysaccharide to residues of the 8-carbon analogue of sialic acid, 

5- acetamido-3,5-dideoxy-D-galactosyloctulosonic acid. Tetanus toxoid is conjugated 
to the polysaccharide by reductive amination using free aldehyde groups present on the 
partially oxidized sialic acid residues. The preparation and conjugation of 

15 oligosaccharides is described in Paoletti et al (Paoletti, L.C. et al (1990) J. Biol Chem 
265: 18278-18283). Purified capsular polysaccharide is depolymerized by enzymatic 
digestion using endo-beta-galactosidase produced by Citrobacter freundii. Following 
digestion, oligosaccharides are fractionated by gel filtration chromatography. Tetanus 
toxoid was covalentiy coupled via a syntiietic spacer molecule to tiie reducing end of 

20 the oligosaccharide by reductive amination. 

Methods and vaccines comprising GBS conjugate vaccines, comprising capsular 
polysaccharide and protein are provided and described in U.S. Patent 5, 993,825, 
5,843,461, 5,795,580, 5,302,386 and 4,356,263, which are incorporated herein by 
25 reference in their entirety. These conjugate vaccines include polysaccharide-tetanus 
toxoid conjugate vaccines. 

One polypeptide proposed to be utilized in a GBS vaccine is the repetitive GBS 
C protein alpha antigen, which contains up to nine tandemly repeated units of 82 
30 amino acids (Michel, J.K. et al (1992) PNAS USA 89: 10060-10064). The 

polypeptide, methods and vaccines thereof, including polysaccharide-conjugate 
vaccines generated therewith, are provided and described in U.S. Patent 5,968,521, 
5,908,629, 5,858,362, 5,847,081, 5,843,461, 5,843,444, 5,820,860, and 5,648,241, 
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which are herein incorporated by reference in their entirety. Antibodies generated 
against C protein alpha antigen with a large numbers of repeats protect against 
infection, but GBS are able to change the structure of the protein by deleting one or 
more of the repeat regions and escape detection by these antibodies (Madoflf, L.C. et 
5 al (1996) PNAS USA 93: 4131-4136). This effect could theoretically be prevented by 
immunization with a protein with a lower number of repeat units, but the 
immunogenicity of the C protein alpha antigen is inversely related to the nimiber of 
repeats - 65% of mice responded to immunization with the 9-repeat protein, but only 
1 1% to a 1-repeat protein (Gravekamp, C. et al (1997) Infect Immunity 65: 5216- 
10 5221). This is a disadvantage with ^y protein with a repetitive structure - it is 
common for bacteria to be able to alter or reassort these genes to alter the proteins 
exposed on their surface. 

Typical doses for a vaccine composed of a protein antigen are in the range of 2.5-50 
15 ug of total protein per dose. Typical doses for a polysaccharide-protein conjugate 
vaccine are 7.5-25 ug of polysaccharide and 1.25-250 ug of carrier protein. These 
types of vaccines are almost always given intramuscularly. Dosing schedules of a 
vaccine can be readily determined by the skilled artisan, particularly by comparison of 
similar vaccines, including other GBS vaccines. If used as a universal vaccine, a GBS 
20 vaccine would be integrated into the routine immunization schedule. Most similar 
vaccines require a primary series of immunizations (usually 2 or 3 doses at 2 month 
intervals beginning at lor 2 months of age) and a single booster at 12-18 months of 
age. A smaller number of doses or a single dose may be adequate in older children 
(over a year of age). For immunization of pregnant women, an exemplary 
25 immunization schedule would be a single dose given in the second or early third 

trimester. For immunization of non-pregnant adults, a single dose would probably be 
used. The requirement for subsequent booster doses in adults is diflBcult to predict - 
this would be based on the immunogenicity of the vaccine and ongoing surveillance of 
vaccine efficacy. 

30 

Immunogenic Compositions 



wo 02/12294 



PCT/USOl/24795 



72 

In a further aspect, the present invention provides an immunogenic composition 
comprising one of more bacterial Ema polypeptides. In a still further aspect, the 
present invention provides an immunogenic composition comprising one of more 
streptococcal Ema polypeptides. In a particular aspect, the present invention provides 
5 an immunogenic composition comprising one of more Group B streptococcal 

polypeptides selected from the group of EmaA, EmaB, EmaC, EmaD, EmaE and a 
fragment thereof, and a pharmaceutically accpetable adjuvant. Immunogenic 
compositions may comprise a combination of one or more Group B Ema polypeptide, 
or an immunogenic polypeptide fragment thereof, with one or more additional GBS 
10 polypeptide or GBS capsular polysa.ccharide or oligosaccharide. 

The present invention further provides an immunogenic composition comprising one 
or more Group B streptococcal polypeptide selected from the group of EmaA, EmaB, 
EmaC, EmaD and EmaE, further comprising one or more antigens selected from the 
15 group of the polypeptide Spbl or an immunogenic fragment thereof, the polypeptide 
Spb2 or an immunogenic fragment thereof, C protein alpha antigen or an immunogenic 
fragment thereof. Rib or an immunogenic fragment thereof, and Group B 
streptococcal polysaccharides or oligosaccharides. 

20 Pharmaceutical Compositions 

The invention provides pharmaceutical compositions comprising a bacterial Ema 
polypeptide, particularly a streptococcal Ema polypeptide, and a pharmaceutically 
acceptable carrier. The invention provides pharmaceutical compositions comprising a 

25 Group B streptococcal polypeptide selected from the group of EmaA, EmaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable carrier. The present invention 
further provides pharmaceutical compositions comprising one or more GBS Ema 
polypeptide, or a fragment thereof, in combination with one or more of GBS 
polypeptide Spbl, Spb2, C protein alpha antigen. Rib, a Group B streptococcal 

30 polysaccharide or oligosaccharide vaccine, and an anti-streptococcal vaccine. 

Such pharmaceutical composition for preventing streptococcal attachment to mucosal 
surface may include antibody to Ema polypeptide EmaA, EmaB, EmaC, EmaD or 
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EmaE or any combination of antibodies to one or more such Ema polypeptide. In 
addition, any such composition may further include antibody to GBS polypeptides 
Spbl, Spb2, C protein alpha antigen, or Rib. Blocking adherence using such antibody 
blocks the initial step in infection thereby reducing colonization. This in turn decreases 
5 person to person transmission and prevents development of symptomatic disease. 

The present invention provides a pharmaceutical composition comprising an antibody 
to a Group B streptococcal protein selected from the group of EmaA, EraaB, EmaC, 
EmaD and EmaE, and a pharmaceutically acceptable carrier. The invention further 
10 provides a pharmaceutical composition comprising a combination of at least two 
antibodies to Group B sixeptococcal proteins and a pharmaceutically acceptable 
carrier, wherein at least one antibody to a protein selected from the group of EmaA, 
EmaB, EmaC, EmaD, EmaE,is combined with at least one antibody to a protein 
selected from the group of Spbl, Spb2, Rib, and C protein alpha antigen. 

15 

It is still a further object of the present invention to provide a method for the 
prevention or treatment of mammals to control the amount or activity of streptococci, 
so as to treat or prevent the adverse consequences of invasive, spontaneous, or 
idiopathic pathological states. 

20 

It is still a further object of the present invention to provide a method for the 
prevention or treatment of mammals to control the amount or activity of Group B 
streptococci, so as to treat or prevent the adverse consequences of invasive, 
spontaneous, or idiopathic pathological states. 

25 

The invention provides a method for preventing infection with a bacterium that 
expresses a streptococcal Ema polypeptide comprising administering an 
immunogenically effective dose of a vaccine comprising an Ema polypeptide selected 
from the group of EmaA, EmaB, EmaC, EmaD and EmaE to a subject. 

30 

The invention further provides a method for preventing infection with a bacterium that 
expresses a Group B streptococcal Ema polypeptide comprising administering an 
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immunogenicaliy effective dose of a vaccine comprising an Ema polypeptide selected 
from the group of EmaA, EmaB, EmaC, EmaD and EmaE to a subject. 

The present invention is directed to a method for treating infection with a bacterium 
5 that expresses a Group B streptococcal Ema polypeptide comprising administering a 
therapeutically effective dose of a pharmaceutical composition comprising an Ema 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, and a 
, pharmaceutically acceptable carrier to a subject. 

10 The invention ftirther provides a method for treating infection with a bacterium that 
expresses a Group B streptococcal Ema polypeptide comprising administering a 
therapeutically effective dose of a pharmaceutical composition comprising an antibody 
to an Ema polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, and a pharmaceutically acceptable carrier to a subject. 

15 

In a further aspect, the invention provides a method of inducing an immune response 
in a subject which has been exposed to or infected with a Group B streptococcal 
bacterium comprising administering to the subject an amount of the pharmaceutical 
composition comprising an Ema polypeptide selected from the group of EmaA, EmaB, 
20 EmaC, EmaD and EmaE, and a pharmaceutically acceptable carrier, thereby inducing 
an immune response. 

The invention still further provides a method for preventing infection by a 
streptococcal bacterium in a subject comprising administering to the subject an amount 
25 of a pharmaceutical composition comprising an antibody to an Ema polypeptide 
selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE and a 
pharmaceutically acceptable carrier or diluent, thereby preventing infection by a 
streptococcal bacterium. 

30 The invention ftirther provides an ema mutant bacteria which is non-adherent and/or 
non-invasive to cells and which is mutated in one or more genes selected from the 
group of emaA, emaB, emaC, emaD and emaE. Particularly, such ema mutant is a 
Group B streptococcal bacteria. Such non-adherent and/or non-invasive ema mutant 
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bacteria can further be utilized in expressing other immunogenic or therapeutic 
proteins for the purposes of eliciting immune responses to any such other proteins in 
the context of vaccines and in other forms of therapy. 

5 This invention provides a method of inhibiting colonization of host cells in a subject 
which has been exposed to or infected with a streptococcal bacterium comprising 
administering to the subject an amount of a pharmaceutical composition comprising an 
Ema polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD and EmaE, 
thereby inducing an immune response. The therapeutic peptide that blocks 
10 colonization is delivered by the respiratory mucosal. The pharmaceutical composition 
comprises the polypeptide selected from the group of SEQ ID NO: 2, 4, 6, 8 and 10. 

As used herein, "pharmaceutical composition" could mean therapeutically effective 
amounts of polypeptide products or antibodies of the invention together with suitable 

15 diluents, preservatives, solubilizers, emulsifiers, adjuvant and/or carriers useful in 
therapy against bacterial infection or in inducing an immune response. A 
"therapeutically effective amoimt" as used herein refers to that amount which provides 
a therapeutic effect for a given condition and administration regimen. Such 
compositions are liquids or lyophilized or otherwise dried formulations and include 

20 diluents of various buffer content (e.g., Tris-HCl., acetate, phosphate), pH and ionic 
strength, additives such as albumin or gelatin to prevent absorption to surfaces, 
detergents (e.g., Tween 20, Tween 80, Pluronic F68, bile acid salts), solubilizing 
agents (e.g., glycerol, polyethylene glycerol), anti-oxidants (e.g., ascorbic acid, sodium 
metabisulfite), preservatives (e.g., Thimerosal, benzyl alcohol, parabens), bulking 

25 substances or tonicity modifiers (e.g., lactose, mannitol), covalent attachment of 

polymers such as polyethylene glycol to the protein, complexation with metal ions, or 
incorporation of the material into or onto particulate preparations of polymeric 
compounds such as polylactic acid, polglycolic acid, hydrogels, etc, or onto liposomes, 
microemulsions, micelles, unilamellar or multilamellar vesicles, erythrocyte ghosts, or 

30 spheroplasts. Such compositions will influence the physical state, solubility, stability, 
rate of in vivo release, and rate of invivQ clearance of the polypeptides of the present 
invention. The choice of compositions will depend on the physical and chemical 
properties of the polypeptide. Controlled or sustained release compositions include 
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formulation in lipophilic depots (e.g., fatty acids, waxes, oils). Also comprehended by 
the invention are particulate compositions coated with polymers (e.g., poloxamers or 
poloxamines) and the polypeptides of the present invention coupled to antibodies 
directed against tissue-specific receptors, ligands or antigens or coupled to ligands of 
5 tissue-specific receptors. Other embodiments of the compositions of the invention 
incorporate particulate forms, protective coatings, protease inhibitors or permeation 
enhancers for various routes of administration, including parenteral, pulmonary, nasal 
and oral. 

10 Further, as used herein "pharmaceutically acceptable carrier" are well known to those 
skilled in the art and include, but are not limited to, 0.01-0. IM and preferably 0.05M 
phosphate buffer or 0.8% saline. Additionally, such pharmaceutically acceptable 
carriers may be aqueous or non-aqueous solutions, suspensions, and emulsions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 

15 vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or 
suspensions, including saline and buffered media. Parenteral vehicles include sodium 
chloride solution. Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's or 
fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte 

20 replenishers such as those based on Ringer's dextrose, and the like. Preservatives and 
other additives may also be present, such as, for example, antimicrobials, antioxidants, 
collating agents, inert gases and the like. 

The phrase "pharmaceutically acceptable" refers to molecular entities and 
25 compositions that are physiologically tolerable and do not typically produce an allergic 
or similar untoward reaction, such as gastric upset, dizziness and the like, when 
administered to a human. 

The phrase "therapeutically effective amount" is used herein to mean an amount 
30 sufficient to prevent, and preferably reduce by at least about 30 percent, more 

preferably by at least 50 percent, most preferably by at least 90 percent, a clinically 
significant infection by streptococcal bacterium. Alternatively, in the case* of a vaccine 
or immunogenic composition, a therapeutically effective amount is used herein to 
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mean an amount sufficient and suitable to elicit an immune response and antibody 
response in an individual, and particularly to provide a response sufficient to prevent, 
and preferably reduce by at least about 30 percent, more preferably by at least 50 
percent, most preferably by at least 90 percent, a clinically significant infection by 
5 streptococcal bacterium. 

Controlled or sustained release compositions include formulation in lipophilic depots 
(e.g. fatty acids, waxes, oils). Also comprehended by the invention are particulate 
compositions coated with polymers (e.g. poloxamers or poloxamines) and the 
10 compound coupled to antibodies directed against tissue-specific receptors, ligands or 
antigens or coupled to ligands of tissue-specific receptors. Other embodiments of the 
compositions of the invention incorporate particulate forms protective coatings, 
protease inhibitors or permeation enhancers for various routes of administration, 
including parenteral, pulmonary, nasal and oral. 

15 

When administered, compounds are often cleared rapidly from mucosal surfaces or 
the circulation and may therefore elicit relatively short-lived pharmacological activity. 
Consequently, firequent administrations of relatively large doses of bioactive 
compounds may by required to sustain therapeutic efficacy. Compounds modified by 

20 the covalent attachment of water-soluble polymers such as polyethylene glycol, 

copolymers of polyethylene glycol and polypropylene glycol, carboxymethyl cellulose, 
dextran, polyvinyl alcohol, pol5rvinylpyrrolidone or polyproline are known to exhibit 
substantially longer half-lives in blood following intravenous injection than do the 
corresponding unmodified compounds (Abuchowski et al., 1981; Newmark et al., 

25 1982; and Katre et al., 1987). Such modifications may also increase the compound's 
solubility in aqueous solution, eliminate aggregation, enhance the physical and 
chemical stability of the compound, and greatly reduce the immunogenicity and 
reactivity of the compound. As a result, the desired in vivo biological activity may be 
achieved by the administration of such polymer-compound abducts less frequently or 

30 in lower doses than with the unmodified compound. 
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Dosages. The sufficient amount may include but is not limited to from about 1 |ag/kg 
to about 1000 mg/kg. The amount may be 10 mg/kg. The pharmaceutically 
acceptable form of the composition includes a pharmaceutically acceptable carrier. 

5 As noted above, the present invention provides therapeutic compositions comprising 
pharmaceutical compositions comprising vectors, vaccines, polypeptides, nucleic acids 
and antibodies, anti-antibodies, and agents, to compete with the Group B 
streptococcus bacterium for pathogenic activities, such as adherence to host cells. 

10 The preparation of therapeutic compositions which contain an active component is 
well understood in the art. Typically, such compositions are prepared as an aerosol of 
the polypeptide delivered to the nasopharynx or as injectables, either as liquid 
solutions or suspensions, however, solid forms suitable for solution in, or suspension 
in, liquid prior to injection can also be prepared. The preparation can also be 

15 emulsified. The active therapeutic ingredient is often mixed witii excipients which are 
pharmaceutically acceptable and compatible witii the active ingredient. Suitable 
excipients are, for example, water, salme, dextrose, glycerol, etiianol, or the like and 
combinations thereof In addition, if desired, the composition can contain minor 
amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering 

20 agents which enhance the effectiveness of the active ingredient. 

An active component can be formulated into the therapeutic composition as 
neutralized pharmaceutically acceptable salt forms. Pharmaceutically acceptable salts 
include the acid addition salts (formed with the free amino groups of the polypeptide 

25 or antibody molecule) and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, 
mandelic, and the like. Salts formed from the free carboxyl groups can also be derived 
from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or 
ferric hydroxides, and such organic bases as isopropylamine, trhnethylamine, 2- 

30 ethylamino ethanol, histidine, procaine, and the like. 

A composition comprising "A" (where "A" is a single protein, DNA moleeule, vector, 
etc.) is substantially free of "B" (where "B" comprises one or more contaminating 
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proteins, DNA molecules, vectors, etc.) when at least about 75% by weight of the 
proteins, DNA, vectors (depending on the category of species to which A and B 
belong) in the composition is "A". Preferably, "A" comprises at least about 90% by 
weight of the A+B species in the composition, most preferably at least about 99% by 

5 weight. 

The phrase "therapeutically effective amount" is used herein to mean an amount 
sufficient to reduce by at least about 15 percent, preferably by at least 50 percent, 
more preferably by at least 90 percent, and most preferably prevent, a clinically 

10 significant deficit in the activity, function and response of the host. Alternatively, a 
therapeutically effective amount is sufficient to cause an improvement in a clinically 
significant condition in the host. In the context of the present invention, a deficit in the 
response of the host is evidenced by continuing or spreading bacterial infection. An 
improvement in a clinically significant condition in the host includes a decrease in 

15 bacterial load, clearance of bacteria from colonized host cells, reduction in fever or 
inflammation associated with infection, or a reduction in any symptom associated with 
the bacterial infection. 

According to the invention, the component or components of a therapeutic 
20 composition of the invention may be introduced parenterally, transmucosally, e.g., 
orally, nasally, pulmonarailly, or rectally, or transdermally. Preferably, administration 
is parenteral, e.g., via intravenous injection, and also including, but is not limited to, 
intra-arteriole, intramuscular, intradermal, subcutaneous, intraperitoneal, 
intraventricular, and intracranial administration. Oral or pulmonary delivery may be 
25 preferred to activate mucosal immunity; since Group B streptococci generally colonize 
the nasopharyngeal and pulmonary mucosa, particularly that of neonates, mucosal 
immunity may be a particularly effective preventive treatment. The term "unit dose" 
when used in reference to a therapeutic composition of the present invention refers to 
physically discrete units suitable as unitary dosage for humans, each unit containing a 
30 predetermined quantity of active material calculated to produce the desired therapeutic 
effect in association with the required diluent; i.e., carrier, or vehicle. 
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In another embodiment, the active compound can be delivered in a vesicle, in 
particular a liposome (see Langer, Science 249:1527-1533 (1990); Treat et al., in 
Liposomes in the Therapy of Infectious Disease and Cancer, Lopez-Berestein and 
Fidler (eds.), Liss, New York, pp. 353-365 (1989); Lopez-Berestein, ibid., pp. 317- 
5 327; see generally ibid). 

In yet another embodiment, the therapeutic compound can be delivered in a controlled 
release system. For example, the polypeptide may be administered using intravenous 
infusion, an implantable osmotic pump, a transdermal patch, liposomes, or other 

10 modes of administration. In one enibodiment, a pump may be used (see Langer, supra; 
Sefton, CRC Crit Ref Biomed. Eng. 14:201 (1987); Buchwald et al. Surgery 88:507 
(1980); Saudek et al, N. Engl, J. Med, 321:574 (1989)). In another embodiment, 
polymeric materials can be used (see Medical Applications of Controlled Release^ 
Langer and Wise (eds.), CRC Pres., Boca Raton, Florida (1974); Controlled Drug 

15 Bioavailability, Drug Product Design and Performance^ Smolen and Ball (eds.), 
Wiley, New York (1984); Ranger and Peppas, J. Macromol, Sci. Rev, MacromoL 
Chem. 23:61 (1983); see also Levy et al.. Science 228:190 (1985); During et aL, Ann. 
Neurol. 25:351 (1989); Howard a/., J. Neurosurg. 71:105 (1989)). In yet another 
embodiment, a controlled release system can be placed in proximity of the therapeutic 

20 target, i.e., the brain, thus requiring only a fraction of the systemic dose (see, e.g., 
Goodson, iriMedical Applications of Controlled Release, supra, vol. 2, pp. 115-138 
(1984)). Preferably, a controlled release device is introduced into a subject in 
proximity of the site of inappropriate immune activation or a tumor. Other controlled 
release systems are discussed in the review by Langer (Science 249:1527-1533 

25 (1990)). 

A subject in whom administration of an active component as set forth above is an 
effective therapeutic regimen for a bacterial infection is preferably a human, but can be 
any animal. Thus, as can be readily appreciated by one of ordinary skill in the art, the 
30 methods and pharmaceutical compositions of the present invention are particularly 

suited to administration to any animal, particularly a mammal, and including, but by no 
means limited to, domestic animals, such as feline or canine subjects, farm animals, 
such as but not limited to bovine, equine, caprine, ovine, and porcine subjects, wild 
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animals (whether in the wild or in a zoological garden), research animals, such as mice, 
rats, rabbits, goats, sheep, pigs, dogs, cats, etc., Le,, for veterinary medical use. 

In the therapeutic methods and compositions of the invention, a therapeutically 
5 effective dosage of the active component is provided. A therapeutically effective 
dosage can be determined by the ordinary skilled medical worker based on patient 
characteristics (age, weight, sex, condition, complications, other diseases, etc.), as is 
well known in the art. Furthermore, as further routine studies are conducted, more 
specific information will emerge regarding appropriate dosage levels for treatment of 

10 various conditions in various patients, and the ordinary skilled worker, considering the 
therapeutic context, age and general health of the recipient, is able to ascertain proper 
dosing. Generally, for intravenous injection or infusion, dosage may be lower than for 
intraperitoneal, intramuscular, or other route of administration. The dosing schedule 
may vary, depending on the circulation half-life, and the formulation used. The 

15 compositions are administered in a manner compatible with the dosage formulation in 
the therapeutically effective amount. Precise amounts of active ingredient required to 
be administered depend on the judgment of the practitioner and are peculiar to each 
individual. However, suitable dosages may range from about 0. 1 to 20, preferably 
about 0.5 to about 10, and more preferably one to several, milligrams of active 

20 ingredient per kilogram body weight of individual per day and depend on the route of 
administration. Suitable regimes for initial administration and booster shots are also 
variable, but are typified by an initial administration followed by repeated doses at one 
or more hour intervals by a subsequent injection or other administration. 
Alternatively, continuous intravenous infusion sufficient to maintain concentrations of 

25 ten nanomolar to ten micromolar in the blood are contemplated. 

Administration with other compounds. For treatment of a bacterial infection, one may 
administer the present active component in conjunction with one or more 
pharmaceutical compositions used for treating bacterial infection, including but not 
30 limited to (1) antibiotics; (2) soluble carbohydrate inhibitors of bacterial adhesin; (3) 
other small molecule inhibitors of bacterial adhesin; (4) inhibitors of bacterial 
metabolism, transport, or transformation; (5) stimulators of bacterial lysis,'Or (6) anti- 
bacterial antibodies or vaccines directed at other bacterial antigens. Other potential 
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active components include anti-inflammatory agents, such as steroids and non- 
steroidal anti-inflammatory drugs. Administration may be simultaneous (for example, 
administration of a mixture of the present active component and an antibiotic), or may 
be in seriatim, 

5 

Accordingly, in specific embodiment, the therapeutic compositions may fiirther include 
an effective amount of the active component, and one or more of the following active 
ingredients: an antibiotic, a steroid, etc. 

10 Thus, in a specific instance where it is desired to reduce or inhibit the infection 

resulting from a bacterium mediated binding of bacteria to a host cell, or an antibody 
thereto, or a ligand thereof or an antibody to that ligand, the polypeptide is introduced 
to block the interaction of the bacteria with the host cell 

15 Also contemplated herein is pulmonary delivery of an inhibitor of the polypeptide of 
the present invention having which acts as adhesin inhibitory agent (or derivatives 
thereof). The adhesin inhibitory agent (or derivative) is delivered to the lungs of a 
mammal, where it can interfere with bacterial, /.e., streptococcal, and preferably Group 
B streptococcal binding to host cells. Other reports of preparation of proteins for 

20 pulmonary delivery are found in the art [Adjei et a/. (1990) Pharmaceutical Research, 
7:565-569; Adjei et a/.(1990) InternationalJournal of Pharmaceutics^ 63:135-144 
(leuprolide acetate); Braquet et al (1989), Journal of Cardiovascular Pharmacology, 
13(suppl. 5): 143-146 (endothelin-1); Hubbard et a/. (1989) Annals of Internal 
Medicine, Vol. DI, pp. 206-212 (a 1 -antitrypsin); Smithed a/. (1989) J. Clin, Invest 

25 84: 1 145-1 146 (a- 1 -proteinase); Oswein et al., "Aerosolization of Proteins", 

Proceedings of Symposium on Respiratory Drug Delivery II, Keystone, Colorado, 
March, (1990) (recombinant human growth hormone); Debs et a/.(1988) J. Immunol. 
140:3482-3488 (interferon-y and tumor necrosis factor alpha); Platz et al, U.S. Patent 
No. 5,284,656 (granulocyte colony stimulating factor)]. A method and composition 

30 for pulmonary delivery of drugs is described in U.S. Patent No. 5,451,569, issued 
September 19, 1995 to Wong et al. 
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All such devices require the use of formulations suitable for the dispensing of adhesin 
inhibitory agent (or derivative). Typically, each formulation is specific to the type of 
device employed and may involve the use of an appropriate propellant material, in 
addition to the usual diluents, adjuvant and/or carriers usefiil in therapy. Also, the use 
5 of liposomes, microcapsules or microspheres, inclusion complexes, or other types of 
carriers is contemplated. Chemically modified adhesin inhibitory agent may also be 
prepared in different formulations depending on the type of chemical modification or 
the type of device employed. 

10 Formulations suitable for use with a.nebulizer, either jet or ultrasonic, will typically 
comprise adhesin inhibitory agent (or derivative) dissolved in water at a concentration 
of about 0. 1 to 25 mg of biologically active adhesin inhibitory agent per ml of solution. 
The formulation may also include a buffer and a simple sugar (e.g., for adhesin 
inhibitory agent stabilization and regulation of osmotic pressure). The nebulizer 

15 formulation may also contain a surfactant, to reduce or prevent surface induced 

aggregation of the adhesin inhibitory agent caused by atomization of the solution in 
forming the aerosol. 

Formulations for use with a metered-dose inhaler device will generally comprise a 
20 finely divided powder containing the adhesin inhibitory agent (or derivative) suspended 
in a propellant with the aid of a surfactant. The propellant may be any conventional 
material employed for this purpose, such as a chlorofluoro carbon, a 
hydrochlorofluorocarbon, a hydrofluorocarbon, or a hydrocarbon, including 
trichlorofluoromethane, dichlorodifluoromethane, dichlorotetrafluoroethanol, and 
25 1,1,1 ,2-tetrafluoroethane, or combinations thereof Suitable surfactants include 
sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a surfactant. 

The liquid aerosol formulations contain adhesin inhibitory agent and a dispersing agent 
in a physiologically acceptable diluent. The dry powder aerosol formulations of the 
30 present invention consist of a finely divided solid form of adhesin inhibitory agent and 
a dispersing agent. With either the liquid or dry powder aerosol formulation, the 
formulation must be aerosolized. That is, it must be broken down into liquid or solid 
particles in order to ensure that the aerosolized dose actually reaches the mucous 
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membranes of the nasal passages or the lung. The term "aerosol particle" is used 
herein to describe the liquid or solid particle suitable for nasal or pulmonary 
administration, /.e., that will reach the mucous membranes. Other considerations, such 
as construction of the delivery device, additional components in the foramlation, and 
5 particle characteristics are important. These aspects of pulmonary administration of a 
drug are well known in the art, and manipulation of formulations, aerosolization means 
and construction of a delivery device require at most routine experimentation by one 
of ordinary skill in the art. In a particular embodiment, the mass median dynamic 
diameter will be 5 micrometers or less in order to ensure that the drug particles reach 
10 the lung alveoli [Wearley, L.L. (1991) Crit Rev. in Ther, Drug Carrier Systems 
8:333]. 

Systems of aerosol delivery, such as the pressurized metered dose inhaler and the dry 
powder inhaler are disclosed in Newman, S.P., Aerosols and the Lung, Clarke, S.W. 
15 and Davia, D. editors, pp. 197-22 and can be used in connection with the present 
invention. 

In a further embodiment, as discussed in detail infra, an aerosol formulation of the 
present invention can include other therapeutically or pharmacologically active 
20 ingredients in addition to adhesin inhibitory agent, such as but not limited to an 
antibiotic, a steroid, a non-steroidal anti-inflammatory drug, etc. 

Liquid Aerosol Formulations, The present invention provides aerosol formulations 
and dosage forms for use in treating subjects suffering from bacterial, e.g, 

25 streptococcal, in particularly streptococcal, infection. In general such dosage forms 
contain adhesin inhibitory agent in a pharmaceutically acceptable diluent. 
Pharmaceutically acceptable diluents include but are not limited to sterile water, saline, 
buffered saline, dextrose solution, and the like. In a specific embodiment, a diluent 
that may be used in the present invention or the pharmaceutical formulation of the 

30 present invention is phosphate buffered saline, or a buffered saline solution generally 
between the pH 7.0-8.0 range, or water. 
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The liquid aerosol formulation of the present invention may include, as optional 
ingredients, pharmaceutically acceptable carriers, diluents, solubilizing or emulsifying 
agents, surfactants and excipients. The formulation may include a carrier. The carrier 
is a macromolecule which is soluble in the circulatory system and which is 
5 physiologically acceptable where physiological acceptance means that those of skill in 
the art would accept injection of said carrier into a patient as part of a therapeutic 
regime. The carrier preferably is relatively stable in the circulatory system with an 
acceptable plasma half life for clearance. Such macromolecules include but are not 
limited to Soya lecithin, oleic acid and sorbitan trioleate, with sorbitan trioleate 
10 preferred. 

The formulations of the present embodiment may also include other agents useful for 
pH maintenance, solution stabilization, or for the regulation of osmotic pressure. 
Examples of the agents include but are not limited to salts, such as sodium chloride, or 
15 potassium chloride, and carbohydrates, such as glucose, galactose or mannose, and the 
like. 

The present invention further contemplates liquid aerosol formulations comprising 
adhesin inhibitory agent and another therapeutically effective drug, such as an 
20 antibiotic, a steroid, a non-steroidal anti-inflammatory drug, etc. 

Aerosol Dry Powder Formulations. It is also contemplated that the present aerosol 
formulation can be prepared as a dry powder formulation comprising a finely divided 
powder form of adhesin inhibitory agent and a dispersant. 

25 Formulations for dispensing from a powder inhaler device will comprise a finely 

divided dry powder containing adhesin inhibitory agent (or derivative) and may also 
include a bulking agent, such as lactose, sorbitol, sucrose, or mannitol in amounts 
which facilitate dispersal of the powder firom the device, e.g., 50 to 90% by weight of 
the formulation. The adhesin inhibitory agent (or derivative) should most 

30 advantageously be prepared in particulate form with an average particle size of less 
than 10 mm (or microns), most preferably 0.5 to 5 mm, for most effective delivery to 
the distal lung. In another embodiment, the dry powder formulation can comprise a 
finely divided dry powder containing adhesin inhibitory agent, a dispersing agent and 
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also a bulking agent. Bulking agents useful in conjunction with the present 
formulation include such agents as lactose, sorbitol, sucrose, or mannitol, in amounts 
that facilitate the dispersal of the powder from the device. 

5 The present invention fiirther contemplates dry powder formulations comprising 
adhesin inhibitory agent and another therapeutically effective drug, such as an 
antibiotic, a steroid, a non-steroidal anti-inflammatory drug, etc. 

Contemplated for use herein are oral solid dosage forms, which are described generally 
10 in Remington's Pharmaceutical Sciences, 1 8th Ed. 1990 (Mack PubUshing Co. Easton 
PA 1 8042) at Chapter 89, which is herein incorporated by reference. Solid dosage 
forms include tablets, capsules, pills, troches or lozenges, cachets or pellets. Also, 
liposomal or proteinoid encapsulation may be used to formulate the present 
compositions (as, for example, proteinoid microspheres reported in U.S. Patent 
15 No. 4,925,673). Liposomal encapsulation may be used and the liposomes may be 

derivatized with various polymers (e.g., U.S. Patent No. 5,013,556). A description of 
possible solid dosage forms for the therapeutic is given by Marshall, K. In: Modern 
Pharmaceutics BditQd by G.S. Banker and C.T. Rhodes Chapter 10, 1979, herein 
incorporated by reference. In general, the formulation will include the component or 
20 components (or chemically modified forms thereof) and inert ingredients which allow 
for protection against the stomach environment, and release of the biologically active 
material in the intestine. 

Also specifically contemplated are oral dosage forms of the above derivatized 
25 component or components. The component or components may be chemically 

modified so that oral delivery of the derivative is efficacious. Generally, the chemical 
modification contemplated is the attachment of at least one moiety to the component 
molecule itself, where said moiety permits (a) inliibition of proteolysis; and (b) uptake 
into the blood stream from the stomach or intestine. Also desired is the increase in 
30 overall stability of the component or components and increase in circulation time in the 
body. Examples of such moieties include: polyethylene glycol, copolymers of 
ethylene glycol and propylene glycol, carbo^methyl cellulose, dextran, pglyvinyl 
alcohol, polyvinyl pyrroUdone and polyproline. Abuchowski and Davis, 1981, 
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"Soluble Polymer-Enzyme Abducts" In: Enzymes as Drugs^ Hocenberg and Roberts, 
eds., Wiley-Interscience, New York, NY, pp. 367-383; Newmark, et al (1982) J, 
AppL Biochem, 4:185-189. Other polymers that could be used are poly-l,3-dioxolane 
and poly-l,3,6-tioxocane. Preferred for pharmaceutical usage, as indicated above, are 
5 polyethylene glycol moieties. 

For the component (or derivative) the location of release may be the stomach, the 
small intestine (the duodenum, the jejunem, or the ileum), or the large intestine. One 
skilled in the art has available formulations which will not dissolve in the stomach, yet 
10 will release the material in the duodenum or elsewhere in the Intestine. Preferably, the 
release will avoid the deleterious effects of the stomach environment, either by 
protection of the protein (or derivative) or by release of the biologically active material 
beyond the stomach environment, such as in the intestine. 

15 To ensure full gastric resistance a coating impermeable to at least pH 5.0 is essential. 
Examples of the more common inert ingredients that are used as enteric coatings are 
cellulose acetate trimellitate (CAT), hydroxypropylmethylcellulose phthalate 
(HPMCP), HPMCP 50, HPMCP 55, polyvinyl acetate phthalate (PVAP), Eudragit 
L30D, Aquateric, cellulose acetate phthalate (CAP), Eudragit L, Eudragit S, and 

20 Shellac. These coatings may be used as mixed films, 

A coating or mixture of coatings can also be used on tablets, which are not intended 
for protection against the stomach. This can include sugar coatings, or coatings which 
make the tablet easier to swallow. Capsules may consist of a hard shell (such as 
25 gelatin) for delivery of dry therapeutic i.e. powder; for liquid forms, a soft gelatin shell 
may be used. The shell material of cachets could be thick starch or other edible paper. 
For pills, lozenges, molded tablets or tablet triturates, moist massing techniques can be 
used. 

30 The peptide therapeutic can be included in the formulation as fine multiparticulates in 
the form of granules or pellets of particle size about 1mm. The formulation of the 
material for capsule administration could also be as a powder, lightly compressed 
plugs or even as tablets. The therapeutic could be prepared by compression. 



wo 02/12294 



PCT/USOl/24795 



88 

Colorants and flavoring agents may all be included. For example, the protein (or 
derivative) may be formulated (such as by liposome or microsphere encapsulation) and 
then further contained within an edible product, such as a refrigerated beverage 
containing colorants and flavoring agents, 

5 

One may dilute or increase the volume of the therapeutic with an inert material. These 
diluents could include carbohydrates, especially mannitol, a-lactose, anhydrous lactose, 
cellulose, sucrose, modified dextran and starch. Certain inorganic salts may be also be 
used as fillers including calcium triphosphate, magnesium carbonate and sodium 
10 chloride. Some commercially availa|)le diluents are Fast-Flo, Emdex, STA-Rx 1500, 
Emcompress and Avicell. 

Disintegrants may be included in the formulation of the therapeutic into a solid dosage 
form. Materials used as disintegrates include but are not limited to starch, including 

15 the commercial disintegrant based on starch, Explotab. Sodium starch glycolate, 
Amberlite, sodium carboxymethylcellulose, ultramylopectin, sodium alginate, gelatin, 
orange peel, acid carboxymethyl cellulose, natural sponge and bentonite may all be 
used. Another form of the disintegrants are the insoluble cationic exchange resins. 
Powdered gums may be used as disintegrants and as binders and these can include 

20 powdered gums such as agar, Karaya or tragacanth. Alginic acid and its sodium salt 
are also useful as disintegrants. Binders may be used to hold the therapeutic agent 
together to form a hard tablet and include materials from natural products such as 
acacia, tragacanth, starch and gelatin. Others include methyl cellulose (MC), ethyl 
cellulose (EC) and carboxymethyl cellulose (CMC). Polyvinyl pyrrolidone (PVP) and 

25 hydroxypropylmethyl cellulose (HPMC) could both be used in alcoholic solutions to 
granulate the therapeutic. 

An antifrictional agent may be included in the formulation of the therapeutic to prevent 
sticking during the formulation process. Lubricants may be used as a layer between 
30 the therapeutic and the die wall, and these can include but are not limited to; stearic 
acid including its magnesium and calcium salts, polytetrafluoroethylene (PTFE), liquid 
paraffin, vegetable oils and waxes. Soluble lubricants may also be used such as sodium 
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lauryl sulfate, magnesium lauryl sulfate, polyethylene glycol of various molecular 
weights, Carbowax 4000 and 6000. 

Glidants that might improve the flow properties of the drug during formulation and to 
5 aid rearrangement during compression might be added. The glidants may include 
starch, talc, pyrogenic silica and hydrated silicoaluminate. 

To aid dissolution of the therapeutic into the aqueous environment a surfactant might 
be added as a wetting agent. Surfactants may include anionic detergents such as 

10 sodium lauryl sulfate, dioctyl sodiunj sulfosuccinate and dioctyl sodium sulfonate. 
Cationic detergents might be used and could include benzalkonium chloride or 
benzethomium chloride. The list of potential nonionic detergents that could be 
included in the formulation as surfactants are lauromacrogol 400, polyoxyl 40 stearate, 
polyoxy ethylene hydrogenated castor oil 10, 50 and 60, glycerol monostearate, 

15 polysorbate 40, 60, 65 and 80, sucrose fatty acid ester, methyl cellulose and 

carboxymethyl cellulose. These surfactants could be present in the formulation of the 
protein or derivative either alone or as a naxture in different ratios. 

Additives which potentially enhance uptake of the polypeptide (or derivative) are for 
20 instance the fatty acids oleic acid, linoleic acid and linolenic acid. 

Pulmonary Delivery, Also contemplated herein is pulmonary delivery of the present 
polypeptide (or derivatives thereof). The polypeptide (or derivative) is delivered to 
the lungs of a mammal while inhaling and coats the mucosal surface of the alveoli. 

25 Other reports of this include Adjei et al (1990) Pharmaceutical Research 7:565-569; 
Adjei et al (1990) International Journal of Pharmaceutics 63 : 135-144 (leuprolide 
acetate); Braquet et al (1989) Journal of Cardiovascular Pharmacology, 
13(suppl. 5): 143-146 (endothelin-1); Hubbard a/. {19B9) Annals of Internal 
Medicineyol lU, pp. 206-212 (al- antitrypsin); Smithed a/. (1989) J. Clin. Invest. 

30 84: 1 145-1 146 (a- 1 -proteinase); Oswein et al (1990) "Aerosolization of Proteins", 
Proceedings of Symposium on Respiratory Drug Delivery 11^ Keystone, Colorado, 
March, (recombinant human growth hormone); Debs et al (1988) J. Immunol 
140:3482-3488 (interferon-g and tumor necrosis factor alpha) and Platz et al., U.S. 
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Patent No. 5,284,656 (granulocyte colony stimulating factor). A method and 
composition for pulmonary delivery of drugs for systemic effect is described in U.S. 
Patent No. 5,451,569, issued September 19, 1995 to Wong et al. 

5 Contemplated for use in the practice of this invention are a wide range of mechanical 
devices designed for pulmonary delivery of therapeutic products, including but not 
limited to nebulizers, metered dose inhalers, and powder inhalers, all of which are 
familiar to those skilled in the art. 

10 Formulations suitable for use with a nebulizer, either jet or ultrasonic, will typically 
comprise polypeptide (or derivative) dissolved in water at a concentration of about 
0. 1 to 25 mg of biologically active protein per mL of solution. The formulation may 
also include a buffer and a simple sugar (e.g,, for protein stabilization and regulation of 
osmotic pressure). The nebulizer formulation may also contain a surfactant, to reduce 

15 or prevent surface induced aggregation of the protein caused by atomization of the 
solution in forming the aerosol. 

Formulations for use with a metered-dose inhaler device will generally comprise a 
finely divided powder containing the polypeptide (or derivative) suspended in a 

20 propellant with the aid of a surfactant. The propellant may be any conventional 
material employed for this purpose, such as a chlorofluorocarbon, a 
hydrochlorofluorocarbon, a hydrofluorocarbon, or a hydrocarbon, including 
trichlorofluoromethane, dichlorodifluoromethane, dichlorotetrafluoroethanol, and 
1, 1, 1,2-tetrafluoroethane, or combinations thereof. Suitable surfactants include 

25 sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a surfactant. 

Formulations for dispensing from a powder inhaler device will comprise a finely 
divided dry powder containing polypeptide (or derivative) and may also include a 
bulking agent, such as lactose, sorbitol, sucrose, or mannitol in amounts which 
30 facilitate dispersal of the powder from the device, e.g., 50 to 90% by weight of the 
formulation. The protein (or derivative) should most advantageously be prepared in 
particulate form with an average particle size of less than 10 mm (or microns), most 
preferably 0.5 to 5 mm, for most effective delivery to the distal lung. 
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Nasal Delivery. Nasal or nasopharyngeal delivery of the polypeptide (or derivative) is 
also contemplated. Nasal delivery allows the passage of the polypeptide directly over 
the upper respiratory tract mucosal after administering the therapeutic product to the 
nose, without the necessity for deposition of the product in the lung. Formulations for 
5 nasal delivery include those with dextran or cyclodextran. 

tide nomenclature, J, Biol. Chem., 243:3552-59 (1969), abbreviations for amino acid 

The therapeutic polypeptide-, analog- or active fragment-containing compositions are 
conventionally administered intravenously, as by injection of a unit dose, for example. 
10 The term "unit dose" when used in reference to a therapeutic composition of the 
present invention refers to physically discrete units suitable as unitary dosage for 
humans, each unit containing a predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with the required diluent; i.e., 
carrier, or vehicle. 

15 

The compositions are administered in a manner compatible with the dosage 
formulation, and in a therapeutically effective amount. The quantity to be 
administered depends on the subject to be treated, capacity of the subject's immune 
system to utilize the active ingredient, and degree of inhibition or neutralization of 

20 binding capacity desired. Precise amounts of active ingredient required to be 
administered depend on the judgment of the practitioner and are peculiar to each 
individual. However, suitable dosages may range from about 0. 1 to 20, preferably 
about 0.5 to about 10, and more preferably one to several, milligrams of active 
ingredient per kilogram body weight of individual per day and depend on the route of 

25 administration. Suitable regimes for initial administration and booster shots are also 
variable, but are typified by an initial administration followed by repeated doses at one 
or more hour intervals by a subsequent injection or other administration. 
Alternatively, continuous intravenous infusion sufficient to maintain concentrations of 
ten nanomolar to ten micromolar in the blood are contemplated. 

30 

The invention may be better understood by reference to the following non-limiting 
Examples, which are provided as exemplary of the invention. The following examples 
are presented in order to more fully illustrate the preferred embodiments of the 
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invention and should in no way be construed, however, as limiting the broad scope of 
the invention. 

EXAMPLE 1 

5 

IDENTIFICATION OF GROUP B STREPTOCOCCUS GENES 

Comparing the genetic and phenotypic composition of genetically-related groups of a 
bacterial species facilitates identifying virulence factors present in the most pathogenic 

10 groups. Type III GBS can be subdiyided into three groups of related strains based on 
the analysis of restriction digest patterns (RDPs) produced by digestion of 
chromosomal DNA with Hind III and Sse 8387 (5, 6). Over 90% of invasive type III 
GBS disease in neonates in Japan and in Salt Lake City is caused by bacteria from one 
of three RDP types, termed RDP type III-3, while RDP type III-2 are significantly 

15 more likely to be isolated from vagina than from blood or CSF (6), These results 
suggest that this genetically-related cluster of type III-3 GBS are more virulent than 
III-2 strains and could be responsible for the majority of invasive type III disease 
globally. We proposed that bacterial factors that contribute to the increased virulence 
of III-3 strains can be identified by characterizing the differences between the genetic 

20 composition of ni-3 and III-2 strains. Such genetic differences will be found in the 
bacterial chromosomes since these strains do not contain plasmids (6). 

To identify genes present in virulent type III-3 GBS strains and not in the avirulent 
type III-2 strains we used a modification of the technique described by Lisitsyn et al 

25 (7). High molecular weight genomic DNA from an invasive RDP type 111-3 GBS strain 
(strain 874391) and a colonizing ("avirulent") RDP type III-2 strain (strain 865043) 
was prepared by cell lysis with mutanolysin and Proteinase K digestion (5). For 
genetic subtraction, genomic DNA firom both strains was digested withTaq L Taq 
I-digested DNA fi"om the virulent strain was mixed with two complementary 

30 oligonucleotides (TaqA (5'-CTAGGTGGATCCTTCGGCAAT-3' (SEQ ID NO: 1 1)) 
and TaqB (5'-CGATTGCCGA-3' (SEQ ID NO: 12)), heated to 50°C for 5 minutes, 
then allowed to cool slowly to 16°C in T4 ligase buffer. Oligonucleotides were ligated 
to the virulent strain DNA by incubation with 20 units of T4 ligase at 16°C for 12 
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hours. After ligation, 500 ng of DNA from the virulent strain, with ligated linkers, and 
40 ug of DNA from the avirulent strain, without linkers, was mixed together, 
denatured by heating, and hybridized at 68°C for 20 hours. 

5 Ten percent of the resulting hybridization mixture was incubated with Taq DNA 
polymerase and dNTPs to fill in the ends of annealed virulent strain DNA. The 
hybridized DNA was amplified by Taq DNA polymerase for 10 cycles using the TaqA 
oligonucleotide as the forward and reverse amplification primer. After amplification, 
single stranded products remaining after amplification were digested with mung bean 
10 nuclease. Twenty percent of the resulting product was then reamplified for 20 cycles. 
This process of subtraction followed by PGR amplification results in enhanced 
amplification of DNA segments from the III-3 strains that do not hybridize with DNA 
segments firom the III-2 strains. 

15 A total of four cycles of subtraction and amplification were carried out, using 

successively smaller quantities of III-3 specific PGR products and alternating two sets 
of adaptors (TaqA/B (SEQ IDNOS: 11 and 12, respectively) and TaqE/F (TaqE (5*- 
AGGGAACTGTGGTAAGCGAGGGAAT-3' (SEQ ID NO: 13)); and TaqF (5'- 
GGATTGCGTGG-3' (SEQ ID NO: 14)). The final amplification products were 

20 ligated into pBS KS+ vectors. Thirteen clones were randomly selected for analysis. 
These probes were used in slot and dot blot experiments to determine whether 
subtraction was successfijl and tp identify probes hybridizing with all III-3 strains. 
Each of the 6 unique probes hybridized with the parental III-3 virulent strain, while 
none of the probes hybridized with the avirulent III-2 strains. Two of the amplified 

25 sequence tags (clones DYl-1 and DYl-1 1) hybridized with genomic DNA firom all 62 
type III isolates, but did not hybridize with DNA prepared from the III-2 and III-l 
isolates (FIGURE 1). To obtain additional sequence information, we constructed a 
genomic GBS III~3 library. Multiple plaques hybridizing with each of the III-3 
GBS-specific probes have been purified for fiirther characterization. 

30 

RESULTS 

• 

THE .vflfe LOCUS 



wo 02/12294 



PCT/USOl/24795 



94 

Three overlapping genomic clones hybridizing with probe DYl-1 were identified. A 
6.4 kb Sal I-Bgl II fragment present in each clone was subcloned and sequenced. This 
genomic DNA is present in all RDP type III-3 strains but not in serotype III-2, III-l or 
other GBS serotype strains. 

5 

Over 90% of this genomic DNA fragment has been sequenced and found to contain 5 
open reading frames (ORFs). Two ORFs appear to be candidates for virulence genes, 
spbl is a 1509 bp ORF. The predicted protein (502 amino acids and Mr 53,446) has 
the characteristics of a cell-wall bound protein. The nucleic acid and predicted amino 

10 acid sequences of sbpl are provided^ in SEQ ID NOS: 15 and 16, respectively. The 
N-terminus of the predicted protein is a hydrophilic, basic stretch of 6 amino acids 
followed by a 23 amino acid hydrophobic, proline-rich core, consistent with a signal 
peptide. The hydrophilic mature protein terminates in a typical LPXTG (SEQ ID NO: 
17) domain that immediately precedes a hydrophobic 20 amino acid core and a short, 

15 basic hydrophilic terminus. The nucleotide sequence is not homologous to sequences 
of other known bacterial genes. The translated amino acid sequence, however, shares 
segmental homology with a number of characterized proteins, including the fimbria! 
type 2 protein of Actinomyces naeshindii (27% identity over 350 amino acids) and the 
fimbrial type 1 protein of Actinomyces viscosus (25% homology over 420 amino 

20 acids) (16), the T6 surface protein of S. pyogenes (23% identity over 359 amino 
acids) (20), and the hsf (27% identity over 260 amino acids) and HMWl adhesins 
(25% identity over 285 amino acids) of Haemophilus influenzae (21, 22). The 
fiinction of the S. pyogenes T6 protein is unknown. Each of the other homologs plays 
a role in bacterial adhesion and/or invasion. 

25 

A spbr isogenic deletion mutant GBS strain was created by homologous 
recombination (using the method as described in Example 2 below) and the ability of 
the spbr mutant to adhere to and invade A549 respiratory epithelial cells was 
determined. Compared to the wild type strain, the number of spbl' bacteria adherent 
30 to A549 monolayers was reduced by 60.0% (p<0.01) and the number of intracellular 
invading bacteria was reduced by 53.6% (p<0.01). This data suggests spbl may 
contribute to the pathogenesis of GBS pneumonia and bacterial entry into the 
bloodstream. 
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The second ORF, spb2, terminates 37 bp upstream from spbl and is in the same 
transcriptional orientation. This 1692 bp ORF has a deduced amino acid sequence of 
579 residues and Mr 64,492, The nucleic acid and predicted amino acid sequences of 
sbp2 are provided in SEQ ID NOS: 18 and 19, respectively. spb2 shares 50.5% 
5 nucleic acid identity and 20.7% amino acid identity with spbL Conservation is 
highest in the carboxy-terminal regions, including a shared LPSTGG (SEQ ID NO: 
20) motif. In contrast to spbl, spb2 does not have a obvious signal sequence. Its 
secretion may be mediated by carboxy-terminal recognition sequences or by accessory 
peptides (23). The deduced amino acid sequence of Spb2 is also homolojgous with S. 
10 pyogenes T6 and Actinomyces naeslundii proteins, and to Listeria monocytogenes 
internalin A (22% identity over 308 amino acids); again, proteins important in 
adhesion and invasion (24). 

THE LOCUS 

15 

Two genonfiic clones hybridizing with probe DYl-1 1 were identified. A 7 kb Hind III 
fragment present in each clone was subcloned and sequenced. Unlike the serotype III 
specific spb sequences, this genomic DNA, which is adjacent to a region of serotype 
III-3 specific DNA, was found to be present in all GBS tested to date, including 
20 serotype la, lb, II and V strains. This region of the GBS chromosome, which we have 
designated the extracellular matrix adhesin (ema) locus, contains 5 significant ORFs. 

emoA 

The first ORF, emoA, is 738 bp long, with a predicted protein product of 246 amino 
25 acids and Mr 26.2. The nucleic acid sequence (SEQ ID NO: 1) and predicted amino 
acid sequence (SEQ ID NO: 2) ofemoA are shown in FIGURE 2. The EmaA protein 
is a non-repetitive protein. The 27 amino acid N-terminus of the predicted protein is 
consistent with a signal peptide. The mature protein has an imperfect cell wall binding 
domain (XPXTGG (SEQ ID N0:21)) followed by a transmembrane spanning domain 
30 encompassing residues 219-235 and a terminal hydrophilic tail. The emoA nucleotide 
sequence is not homologous to known sequences of bacterial genes. The translated 
amino acid sequence, however, shares segmental homology with a number of 
characterized proteins, including a collagen adhesin, Bbp, of Staphylococcus aureus 
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(37% identity over 103 aa) (15), a type 2 fimbrial structural subunit of Actinomyces 
naeslandii (39% homology over 1 12 aa) (16), and the FimP protein oi Actinomyces 
viscosus (28% homology over 228 aa) (17). The function of the S. pyogenes T6 
protein is unknown. The type 1 and type 2 fimbria of Actinomyces mediate bacterial 
5 adhesion to salivary glycoproteins and various host cells, contributing to the 
pathogenesis of dental caries. 

emaB 

The second ORF, emaB, begins 94 bp 3' of emoA and is in the same transcriptional 
10 orientation. The nucleic acid sequence (SEQ ID NO: 3) and predicted amino acid 

sequence (SEQ ID NO: 4) of emaB are shown in FIGURE 3. It is 924 bp long; with a 
predicted protein product of 308 amino acids and Mr 33.9. The predicted EmaB 
protein is a nonrepetitive protein. The 27 amino acids N-terminus of the predicted 
protein is consistent with a signal peptide. The mature protein has an imperfect cell 
15 wall binding domain (XPXTG) followed by a transmembrane spanning domain 

encompassing residues 279-294. The emaB nucleotide sequence is not homologous to 
known sequences of bacterial genes. The translated amino acid sequence, however, 
shares segmental homology with a number of characterized proteins, including a type 2 
fimbrial structural subunit of Actinomyces naeslandii (28% homology over 222 amino 
20 acids), the T6 protein of S. pyogenes (26% homology over 266 amino acids) (20), and 
a S. epidermidis putative cell-surface adhesin (24% identity over 197 amino acids). 
The first of these proteins mediates adhesion of 51 aureus to collagen and is postulated 
to contribute to the pathogenesis of osteomyelitis and infectious arthritis. 

25 emaC 

The third ORF, emaC, begins 2 bp 3* of emaB and is the same transcriptional 
orientation. It is 918 bp long, with a predicted protein product of 305 amino acids and 
Mr 34.5. The nucleic acid sequence (SEQ ID NO: 5) and predicted amino acid 
sequence (SEQ ID NO: 6) of emaC are depicted in FIGURE 4. The EmaC protein is 
30 a nonrepetitive protein. The 30 amino acid N-terminus of the predicted protein is 
consistent with a signal peptide. The mature protein has a transmembrane spanning 
domain emcompassing residues 265 - 281. The emaC nucleotide sequence is not 
homologous to known sequences of bacterial genes. The translated amino acid 
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sequence, however, shares segmental homology with a number of characterized 
proteins, including proteins associated with the assembly of type 2 fimbrial structural 
subunit oi Actinomyces naeslandii (38% homology over 234 amino acids) (16). 
These proteins are required for the assembly of type 2 fimbria. 

5 

« 

emaP 

The fourth ORF, emoD, is 852 bp long, overlaps emaC by 47 bp, and is in the same 
transcriptional orientation. The predicted protein product is 284 amino acids and Mr 
33.1, The nucleic acid sequence (SEQ ID NO: 7) and predicted amino acid sequence 

10 (SEQ ID NO:8) of emaD are showx\ in FIGURE 5. No indentifiable N-terminal signal 
sequence is present and potential transmembrane segments are present at positions 
19-35 and 252-280. The mature protein is not repetative and lacks a cell wall binding 
domain. The emaD nucleotide sequence is not homologous to known sequences of 
bacterifil genes. The translated amino acid sequence, shares segmented homology with 

15 the same fimbria-associated proteins of Actinomyces as does EmaC. 

emaE 

The fifth ORF, emoE, begins 42 bp 3 ' of emaD and is in the same transcriptional 
orientation. It is 2712 bp long, with a predicted protein product of 904 aa and Mr 

20 100.9. FIGURE 6 depicts the nucleic acid sequence (SEQ ID NO: 9) and predicted 
amino acid sequence (SEQ ID NO: 10) of emaE. The predicted EmaE protein is a 
nonrepetitive protein. An obvious N-terminal signal peptide is not evident but a 
putative transmembrane region is located at residues 24-40. The mature protein has an 
imperfect cell wall binding domain (XPXTGG (SEQ ID NO: 21)) followed by a 

25 transmembrane spanning domain emcompassing residues 880 - 896. The emaE 

nucleotide sequence is not homologous to known sequences of bacterial genes. The 
translated amino acid sequence, however, shares segmental homology with a number 
of characterized proteins, including the Fl and F2 fibronectin binding proteins of S. 
pyogenes (31% homology over 207 amino acids) (18, 19). These proteins mediate 

30 high affinity binding to. fibronectin, and are important in the adhesion of S. pyogenes to 
respiratory cells. 
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The similarity of the protein products of the ema locus to physiologically important 
adhesins and invasins of other bacterial species suggests that the Ema proteins have a 
role in facilitating the adhesion of GBS to extracellular matrix components and to cell 
surfaces and subsequent invasion of epithelial and endothelial cells, the initial steps in 
5 the pathogenesis of infection. 

Several lines of evidence suggest the members of the ema and the spb locus may have 
similar functions, but are likely to represent distinct classes of proteins. The ema and 
spb locus genes are each and all similar to physiologically important adhesions and 

10 invasions of the bacterial species, however, both Spbl and Spb2 have prototypical 
gram positive cell-wall binding domains, whereas the members of the ema locus have 
an unusual motif, suggesting a distinct mechanism of cell surface anchoring. Second, 
the spb locus is restricted to virulent serotype III-3 strains of GBS, whereas the ema 
locus appears to be ubiquitous in all GBS serotypes. Third, spbl and spb2 are more 

15 homologous to one another than to members of the ema locus and ema genes are 
more closely homologous to one another than to spbl and spb2, 

EXAMPLE 2 

20 BIOLOGIC CHARACTERIZATION OF NOVEL GBS GENES 

Isogenic Mutant Bacterial Strains 

To identify biologic activity of these novel GBS genes, isogenic mutant bacterial 
25 strains are created which are identical in all respects except for the presence or absence 
of a particular gene. Deletion mutants are created by allelic replacement. The relevant 
gene, with 100-300 bp of flanking sequences, is subcloned and modified by the 
deletion of an intragenic portion of the coding sequence and, in some cases, the 
insertion of a kanamycin resistance gene. The mutant gene is cloned into the suicide 
30 vector pHY304 (kindly provided by Dr. Craig Rubens), a broad host range plasmid 
containing a temperature sensitive ori, erythromycin resistance gene {erm^% and a 
pBS multiple cloning site. The pHY304 vector is a derivative of the vector pWVOl 
(Framson, P.E. et al (1997) Applied Environ Microbiology 63:3539-3547). Plasmids 
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containing mutant genes are electroporated into strain 874391 and single cross-over 
mutants are selected by antibiotic resistance at 37^C. The resulting antibiotic resistant 
colonies are subjected to a temperature shift to 30°C. Integration of the plasmid is 
unstable at this permissive temperature because there are two functional ori's on the 
5 chromosome. Excised plasmid is eliminated by growth on nonselective media for 
many generations, then colonies are screened for the presence of the mutant allele by 
erythromycin-sensitivity. Double-crossover mutants are stable and do not require 
maintenance under dmg selection. The mutant genotype is confirmed by Southern 
blotting or PCR demonstrating the appropriate deletion. The resulting mutants are 

10 screened for the presence of gene expression by Northern and Western blot analysis. 
The phenotype of the knockout mutants is then compared with that of the wild type 
strain 874391 by examining growth rate and colony morphology, and the expression of 
P -hemolysin and CAMP factor. Surface protein expression is assessed by Western 
blot, using polyclonal sera from rabbits immunized with whole, heat-killed type III 

15 GBS. 

In Vitro Models 
A. Adherence 

20 Adhesion of GBS to host cells is a prerequisite for invasive disease. Three different 
cell types have the potential to be important in this process: i) adhesion to respiratory 
epithelial cells is likely to facilitate most early onset neonatal infections, ii) adhesion to 
gastrointestinal epithelial cells has been postulated to be important in the pathogenesis 
of late onset neonatal infections, and iii) adhesion to endothelial cells is necessary for 

25 both endocarditis and other endovascular infections, and is likely to be the initial event 
in GBS meningitis. The ability of wild type and mutant strains to adhere to epithelial 
and endothelial cells is compared in adhesion assays. 

Four different cell lines are used to investigate the role of novel GBS genes in 
30 adhesion. GBS adhere to and invade A549 human alveolar epitheUal carcinoma cells 
and surface proteins appear to play an important role in this process (8). GBS binding 
to A549 cells is used as an in vitro model for respiratory colonization. GBS also 
adhere to C2BBeL, a human intestinal epithelial cell line, which is used as a model for 



wo 02/12294 



PCT/USOl/24795 



100 

gastrointestinal colonization, and to HeLa cervical epithelial cells, a model for genital 
colonization and maternal infection. For endothelial adhesion, two cell lines are 
studied: freshly isolated human umbiUcal vein endothelial (HUVE) cells; and an 
immortalized human brain microvascular endothelial cell line (BMEC). Adhesion 
5 assays are performed as described by Tamura et al (9). Cell lines are grown to 
confluence in 96-well tissue culture plates in recommended media. Monolayers are 
washed with PBS and fixed with 0.5% gluteraldehyde. Following blocking with 5% 
BSA in PBS, cells are inoculated with various inocula of GBS, centrifuged for 10 
minutes at 2000 rpm and incubated for 1 hour at 4°C. Nonadherent bacteria are 
10 removed by washing three times with 5% nonfat dry milk in PBS and bound bacteria 
are then eluted and plated quantitatively. 

B. Invasion 

GBS adhere to and invade respiratory epithelium, endothelium and BMEC (8, 10, 1 1). 
15 The ability of wild type and isogenic mutant GBS strains to invade the above 

epithelial and endothelial cells are tested as previously described (8, 10, 1 1). Assays 
that distinguish the ability of GBS to invade eukaryotic cells versus adhere to cells 
capitalize on the inability of penicillin and gentamicin to enter host cells, allowing 
quantification of intracellular bacteria after extracellular bacteria are killed. GBS are 

20 grown to the desired grov^h phase in TH broth, washed twice with PBS and 

resuspended in tissue culture media containing 10% fetal calf serum. Tissue culture 
monolayers grown to confluence in 24-well plates are inoculated with varying inocula 
of GBS, centrifuged at 800xg and incubated at 37°C in 5% CO2 for 2-6 hours. 
Extracellular bacteria are removed by washing four times with PBS. Cells are then 

25 incubated in fresh medium with 5 mg/ml penicillin and 100 mg/ml gentamicin for 2 
hours. Media is then removed, monolayers washed, and cells lysed by treatment with 
0.025% Triton X-100. Cell lysates are sonicated to disrupt bacterial chains and 
aliquots plated quantitatively. 

30 C. Antibody to GRS Protmnf; 

The ability of specific antibody to the novel GBS proteins to promote 
opsonophagocytic killing of GBS is tested (12). Rabbits are immunized with 
recombinant or purified GBS proteins produced by standard techniques. Rabbit 
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antiserum of different dilutions (ranging from 1/50 to 1/5,000) that has been 
exhaustively absorbed with the relevant isogenic mutant strain at 4°C will be incubated 
with GBS in the presence of human complement and polymorphonuclear leukocytes (3 
X 10^). Opsonophagocytic killing is expressed as the log number of CFU surviving 
5 following 1 hour of incubation subtracted from the log of the number of CFU at the 
zero time point. Killing of wild type strains is compared to that of isogenic mutants 
lacking novel GBS proteins. 

In Vivo Models 

10 

The neonatal rat has been used by numerous laboratories as a model of GBS infection 
because it closely mimics human neonatal infection (13). The contribution of novel 
genes to the pathogenesis of GBS infections is tested by comparing wild type and 
mutant in this system. Rat pups are inoculated by two routes. First, pups are 
15 inoculated intranasally to mimic the respiratory infection and sepsis typical of early 
onset GBS infection. Secondly, intraperitoneal or subcutaneous inoculation 
reproduces the high grade bacteremia associated with GBS sepsis and that precedes 
GBS meningitis (14). 

20 Rat pups are inoculated with varying doses of GBS strains and mortality is determined. 
The level of bacteremia is determined by quantitative blood cultures. Lung, liver, 
spleen and meningeal tissue are preserved for histologic examination. 

The ability of antiserum to the GBS proteins to protect neonatal rats from GBS 
25 infection is tested (13). Newborn rats (<18 hours old) receive an intraperitoneal 
injection of 0.5 ml of undiluted rabbit antiserum, followed by the intraperitoneal 
inoculation of the equivalent of one LD50 unit of GBS (usually about 5000 bacteria) 
in PBS. Mortality and morbidity are then determined. 

30 Role of Novel GBS Proteins in Vaccines 



Several surface proteins of GBS, including C and Rib are immunogenic and protective 
against GBS infection in infant rodent models (25, 26). None of these proteins are 
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present in all GBS strains (27). Furthermore, each of these proteins has a repetitive 
structure. The phenotypic variability of these repetitive proteins allows escape 
mutants expressing variant forms to evade host immune systems and may limit the 
effectiveness of these vaccines (28). It is notable that each of the predicted proteins of 
5 the spb and ema loci do not have a repetitive structure and would not have this 
disadvantage. 

The novel GBS proteins we describe here may be useful antigens for a GBS vaccine. 
The data presented herein indicates these proteins have a role in mediating adhesion to 

10 and invasion of GBS to human epithelial cells, thus antibody against these antigens 
may prevent these initial steps in infection. It is highly desirable to develop a vaccifie 
that prevents colonization of pregnant women and other individuals at increased risk 
of invasive GBS infection, as this would eliminate most infections. Our data suggests 
that antibody against Spbl is effective in reducing colonization or infection following 

15 colonization with highly virulent strains of serotype III, and therefore this protein is a 
particularly useful vaccine antigen. Members of the ema locus, unlike spbl and spb2, 
are ubiquitous in GBS and therefore have a role in the prevention of infection by 
multiple serotypes of GBS. An optimal vaccine formulation includes combinations of 
these antigens. 

20 

Two strategies are used to design GBS vaccines using these novel proteins. First, 
purified recombinant or affinity-purified proteins are used as vaccine antigens, singly 
or in combination (25). Second, these proteins are used as carrier proteins for 
capsular polysaccharide or oligosaccharide-based vaccines. GBS polysaccharides and 

25 oligosaccharides are generally poorly immunogenic and fail to elicit significant memory 
and booster responses (29). Conjugation of these polysaccharides or oligosaccharides 
to protein carriers increases immunogenicity. GBS polysaccharide conjugated to 
tetanus toxoid, for example, has been used to immunize pregnant women and results in 
high levels of maternal serum anti-polysaccharide antibody which may be transferred 

30 to the fetus in the third trimester of pregnancy (30). Selection of appropriate carrier 
proteins is important for the development of polysaccharide-protein vaccine 
formulations. For example, Haemophilus influenzae type b poly- or oligosaccharide 
conjugated to different protein carriers has variable immunogenicity and elicits 
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antibody with varying avidity (31,32). Repeated immunization with the same carrier 
protein may also suppress immune responses by competition for specific B cells 
(epitopic suppression) or other mechanisms. This is of particular concern for the 
development of GBS vaccines since recently developed polyaccharide and 
5 oligosaccharide-protein conjugate vaccines against the bacteria H, influenzae, S. 
pneumoniae, and K meningitidis all utilize a restricted number of carrier proteins 
(tetanus toxoid, CRM 197, diptheria toxoid), increasing the number of exposures to 
these carriers an individual is likely to recieve. A "designer" vaccine, composed of a 
GBS polysaccharide or oligosaccharide coupled to a GBS-specific carrier protein, 
10 such as the novel GBS polypeptides. provided herein, particularly including Spbl, 
EmaC and EmaE, may be a preferable strategy. The large size of certain of these 
novel GBS antigens may also be an advantage to traditional carrier proteins as 
increasing size is associated with improved immunogenicity. 

15 EXAMPLE 3 

EMA HOMOLOGS IN STREPTOCOCCI AND OTHER BACTERIA 

As noted above, the GBS Ema proteins share segmental homology with certain 
characterized proteins from other bacterial species, including bacterial adhesion and 

20 invasion proteins. The segmental homolog is noted as in the range of 24-39%, In 
addition, the Ema proteins demonstrate some homology to one another. A 
comparison of the ema genes shows that EmaA and EmaB are 47% homologous, 
however, due to the difference in their predicted lengths it is necessary to insert gaps 
in the EmaA sequence in order to line them up. The two Ema proteins which are most 

25 similar in structure, EmaC and EmaD share 48.7% amino acid homology to one 

another. EmaA/B, EmaC/D and EmaE are each <. 20% homologous to one another. 

The ema sequences were used to search the unannotated microbial genomes 
(Eubacteria). The predicted Ema proteins were searched against translations in all six 
30 frames (tblast x) of finished and unfinished unannotated microbial genomes available at 
the web site of the National Center for Biotechnology Information (NCBI). 
Segmental amino acid homolog was identified. 
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EmaA has some segmental homolog with pneumoniae, E, faecctlis, B. anthracis and 
C diptheriae, Ema B has some segmental homolog with B, anthracis, EmaE has 
segmental homology to S, pyogenes and lesser homology to B, anthracis. 

5 Significant homology was identified between the GBS EmaC and EmaD and proteins 
in other bacterial species. EmaC has significant (55% identity over 149 amino acids) 
homology to a region of the S. pneumoniae chromosome and the 51 pyogenes 
chromosome (47% identity over 150 amino acids). Lesser segmental homology was 
found to E. faecalis, S, equi, and C. diptheriae, EmaD has strong segmental 
10 homology (66% over 184 amino acids) to a region of the S. pneumoniae chromosome, 
and lesser segmental homology to C diphtheriae and S, pyogenes. 

We have identified two Ema homologs in S. pneumoniae. These pneumoniae 
homologs show homology to EmaC and EmaD and, like EmaC and EmaD, also 

15 demonstrate homology to fimbria-associated protein oi Actinomyces. The encoding 
nucleic acid and predicted amino acid sequence of the first S, pneumoniae EmaC/D 
homolog are provided in SEQ ID NOS: 24 and 23, respectively. The genome region 
nucleic acid including the first homolog encoding sequence is provided in SEQ ID NO: 
22. The nucleic acid and predicted amino acid sequence of the second S. pneumoniae 

20 EmaC/D homolog are provided in SEQ ID NOS: 27 and 26 respectively. The 
genomic region nucleic acid of this second homolog is found in SEQ ID NO: 25. 
An EmaC/D homolog has been identified in Enterococcus faecalis by search and 
analysis. The E. faecalis EmaC/D homolog predicted amino acid sequence is provided 
in SEQ ID NO: 29. The nucleic acid sequence encoding this E. faecalis Ema homolog 

25 is provided in SEQ ID NO: 30. The nucleic acid sequence of £1 faecalis which 
genomic region encodes the EmaC/D homolog is provided in SEQ ID NO: 28. 

We have also identified an EmaD homolog in Corynebacterium diptheriae. The 
predicted amino acid sequence of the C. diptheriae EmaD homolog is provided in 
30 SEQ ID NO: 32. C. diptheriae nucleic acid sequence which encodes the homolog is 
found in SEQ ID NO: 33. The corresponding genomic region sequence of C. 
diptheriae is provided in SEQ ID NO: 31. 
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A predicted EmaC/D homolog has been identified in S, pyogenes. The predicted 
partial amino acid sequence of this Ema homolog provided in SEQ ID NO: 37. 

A region of amino acids TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34) is 
5 found in GBS EmaC, GBS EmaD, in both the EmaC/D homologs of S. pneumoniae, 
and in the E. faecalis Ema homolog. A similar sequence 

TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35) is also found in the C diptheriae 
Ema homolog. The S. pyogenes predicted Ema homolog has a similar sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36) as well. 

10 
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Various references are cited throughout this Specification, each of which is 
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WHAT IS CLAIMED IS : 

1 . An isolated streptococcal polypeptide EmaA. 

2. The EmaA polypeptide of Claim 1 which comprises the amino acid sequence 
set out in SEQ ID NO: 2, and analogs, variants and immunogenic fragments 
thereof. 

3 . An isolated streptococcal polypeptide EmaB. 

4. The EmaC polypeptide of Claim 3 which comprises the amino acid sequence 
set out in SEQ ID NO: 4, and analogs, variants and immunogenic fragments 
thereof. 

5. An isolated streptococcal polypeptide EmaC. 

6. The EmaC polypeptide of Claim 5 which comprises the amino acid sequence 
set out in SEQ ID NO: 6, and analogs, variants and immunogenic fragments 
thereof. 

7. An isolated streptococcal polypeptide EmaD. 

8. The EmaD polypeptide of Claim 7 which comprises the amino acid sequence 
set out in SEQ ID NO: 8, and analogs, variants and immunogenic fragments 
thereof. 

9. An isolated streptococcal polypeptide EmaE. 

10. The EmaE polypeptide of Claim 9 which comprises the amino acid sequence 
set out in SEQ ID NO: 10, and analogs, variants and immunogenic fragments 
thereof. 
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11. The streptococcal polypeptide of aiiy of Claims 1, 3, 5, 7 or 9 labeled with a 
detectable label. 

12. A vaccine comprising one or more streptococcal polypeptides selected from 
the group of EmaA, EmaB, EmaC, EmaD and EmaE, and a pharmaceutically 
acceptable adjuvant. 

13 . The vaccine of Claim 12, further comprising an antigen selected from the 
group consisting of; 

a. the polypeptide Spbl or an immunogenic fragment thereof; 

b. the polypeptide Spb2 or an immunogenic fragment thereof; 

c. the polypeptide C protein alpha antigen or an immunogenic fragment 
thereof; 

d. the polypeptide Rib or an immunogenic fragment thereof; 

e. the polypeptide Lmb or an immunogenic fragment thereof; 

f the polypeptide C5a-ase or an immunogenic fragment thereof; 

g. Group B streptococcal polysaccharides or oligosaccharides; and 

h. any combination of one or more of the foregoing. 

14. An immunogenic composition comprising one of more streptococcal 
polypeptides selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, and a pharmaceutically acceptable adjuvant. 

1 5. The immunogenic composition of Claim 14, ftirther comprising an antigen 
selected from the group consisting of: 

a. the polypeptide Spbl of an immunogenic fragment thereof; 

b. the polypeptide Spb2 or an immunogenic fragment thereof; 

c. the polypeptide C protein alpha antigen or an immunogenic fragment 
thereof; 

d. the polypeptide Rib or an immunogenic fragment thereof; 

e. the polypeptide Lmb or an immunogenic fragment thereof; 

f. the polypeptide C5a-ase or an immunogenic fragment thereof; 

g. Group B streptococcal polysaccharides or oligosaccharides; and 
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h. any combination of one or ihore of the foregoing. 

16. A pharmaceutical composition comprising one or more streptococcal 
polypeptides selected from the group of EmaA, EmaB, EmaC, EmaD and 
EmaE, and a pharmaceutically acceptable carrier. 

17. The pharmaceutical composition of Claim 16, further comprising an active 
ingredient selected from the group consisting of: 

a. Spbl or Spb2 polypeptide; 

b. C protein alpha antigen; 

c. Rib polj^eptide; 

d. Lmb polypeptide; 

e. C5a-ase polypeptide; 

f . a Group B streptococcal polysaccharide or oligosaccharide; and 

g. an anti-streptococcal vaccine. 

18. A purified antibody to a streptococcal polypeptide selected from the group of 
EmaA, EmaB, EmaC, EmaD and EmaE. 

19. A monoclonal antibody to a streptococcal polypeptide selected from the group 
of EmaA, EmaB, EmaC, EmaD and EmaE. 

20. An immortal cell line that produces a monoclonal antibody according to Claim 
19. 

2 1 . The antibody of any of Claims 1 9 or 20 labeled with a detectable label. 

22. The antibody of Claim 21 wherein the label is selected from the group 
consisting of an enzyme, a chemical which fluoresces, and a radioactive 
element. 
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23. A pharmaceutical composition comprising one or more antibodies to a 
streptococcal protein selected from the group of EmaA, EmaB, EmaC, EmaD 
and EmaE, and a pharmaceutically acceptable carrier. 

24. A pharmaceutical composition comprising a combination of at least two 
antibodies to streptococcal proteins and a pharmaceutically acceptable carrier, 
wherein at least one antibody to a protein selected from the group of EmaA, 
EmaB, EmaC, EmaD and EmaE, is combined with at least one antibody to a 
protein selected from the group of Spbl and Spb2, Rib, Lmb, C5a-ase and C 
protein alpha antigen. 

25. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
1, or a fragment thereof 

26. The isolated nucleic acid of Claim 25, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO : 1 ; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f hybridizable fragments thereof. 

27. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
3. 

28. The isolated nucleic acid of Claim 27, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 3; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 
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c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f. hybridizable fragments thereof 



An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
5. 

The isolated nucleic acid of Claim 29, wherem the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 5; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f hybridizable fragments thereof 

An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
7. 



The isolated nucleic acid of Claim 3 1, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 7; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f hybridizable fragments thereof 
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33. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
9. 

34. The isolated nucleic acid of Claim 33, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 9; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 

« 

by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f hybridizable fragments thereof 

35. A vector which comprises the nucleic acid of any of Claims 25, 27, 29, 3 1 or 
33 and a promoter. 

36. The vector of Claim 35, wherein the promoter comprises a bacterial, yeast, 
insect or mammalian promoter. 

37. The vector of Claim 35, wherein the vector is a plasmid, cosmid, yeast artificial 
chromosome (YAC), bacteriophage or eukaryotic viral DNA. 

38. A host vector system for the production of a polypeptide which comprises the 
vector of Claim 35 in a suitable host cell. 

39. The host vector system of Claim 38, wherein the suitable host cell comprises a 
prokaryotic or eukaryotic cell 

40. The nucleic acid of any of Claims 25, 27, 29, 31 or 33 which is a recombinant 
DNA molecule. 
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The recombinant DNA molecule of Claim 40, wherein the DNA molecule is 
operatively linked to an expression control sequence, 

A unicellular host transformed with a recombinant DNA molecule of Claim 40. 

A nucleic acid vaccine comprising the recombinant DNA molecule of Claim 
40. 

A method for detecting the presence of a streptococcal polypeptide selected 
from the group of EmaA, EmaB, EmaC, EmaD and EmaE, wherein the 
streptococcal polypeptide is measured by: 

a. contacting a sample in which the presence or activity of a streptococcal 
polypeptide selected from the group of EmaA, EmaB, EmaC, EmaD 
and EmaE is suspected with an antibody to the said streptococcal 
polypeptide under conditions that allow binding of the streptococcal 
polypeptide to antibody to occur; and 

b. detecting whether binding has occurred between the streptococcal 
polypeptide from the sample and the antibody; 

wherein the detection of binding indicates the' presence or activity of the streptococcal 
polypeptide in the sample. 

45. A method for detecting the presence of a bacterium having a gene encoding a 
streptococcal polypeptide selected from the group of emaA, emaB, emaC, 
emaD and emaE, comprising: 

a. contacting a sample in which the presence or activity of the bacterium 
is suspected with an oligonucleotide which hybridizes to a 
streptococcal polypeptide gene selected from the group of emoA, 
emaB, emaC, emaD and emaE, under conditions that allow specific 
hybridization of the oligonucleotide to the gene to occur; and 

b. detecting whether hybridization has occurred between the 
oligonucleotide and the gene; 

wherein the detection of hybridization indicates that presence or activity of the 
bacterium in the sample. 
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46. A method for preventing infection with a bacterium that e?qpresses a 
streptococcal Ema polypeptide comprising administering an immunogenically 
effective dose of a vaccine of Claim 12 to a subject. 

47. A method for preventing infection with a bacterium that expresses a 
streptococcal Ema polypeptide comprising administering an immunogenically 
effective dose of the immunogenic composition of Claim 14 to a subject. 

48. A method for treating infection with a bacterium that expresses a streptococcal 
Ema polypeptide comprising administering a therapeutically effective dose of a 
pharmaceutical composition of Claim 16 to a subject. 

49. A method for treating infection with a bacterium that expresses a streptococcal 
Ema polypeptide comprising administering a therapeutically effective dose of a 
pharmaceutical composition of Claim 23 to a subject. 

50. A method of inducing an immune response in a subject which has been exposed 
to or infected with a streptococcal bacterium comprising administering to the 
subject an amount of the pharmaceutical composition of Claim 16, thereby 
inducing an immune response. 

51. A method for preventing infection by a streptococcal bacteriuna in a subject 
comprising administering to the subject an amount of a pharmaceutical 
composition of Claim 23 and a pharmaceutically acceptable carrier or diluent, 
thereby preventing infection by a streptococcal bacterium. 

52. An isolated streptococcal Ema polypeptide comprising the amino acid 
sequence set out in SEQ ID NO:23. 

53. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
52. 
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5 54. The isolated nucleic acid of Claim 53, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 24; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

10 c. DNA sequences capable of encoding the amino acid sequence encoded 

by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f hybridtzable fragments thereof 

15 55. An isolated streptococcal Ema polypeptide comprising the amino acid 
sequence set out in SEQ ID NO:26. 

56. An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
55. 

57. The isolated nucleic acid of Claim 56, wherein the nucleic acid is selected from 
20 the group consisting of: 

a. the DNA sequence of SEQ ID NO: 27; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
25 by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

f hybridizable fragments thereof 

58. An isolated streptococcal Ema polypeptide comprising the amino acid 
30 sequence set out in SEQ ED NO:37. 



59. 



An isolated nucleic acid which encodes the streptococcal polypeptide of Claim 
58. 
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60. An enterococcal Ema polypeptide comprising the amino acid sequence set out 
in SEQ ID NO:29. 

35 61 . An isolated nucleic acid which encodes the enterococcal polypeptide of Claim 
60. 

62. The isolated nucleic acid of Claim 61, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 30; 
40 b. DNA sequences that hybridize to the sequence of subpart (a) under 

moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 
45 e. alleles thereof; and 

f hybridizable fragments thereof 

63. An isolated Corynebacterium Ema polypeptide comprising the amino acid 
sequence set out in SEQ ID NO: 32. 

64. An isolated nucleic acid which encodes the Corynebacterium polypeptide of 
50 Claim 63, 

65. The isolated nucleic acid of Claim 64, wherein the nucleic acid is selected from 
the group consisting of: 

a. the DNA sequence of SEQ ID NO: 33; 

b. DNA sequences that hybridize to the sequence of subpart (a) under 
55 moderate stringency hybridization conditions; 

c. DNA sequences capable of encoding the amino acid sequence encoded 
by the DNA sequences of (a) or (b); 

d. degenerate variants thereof; 

e. alleles thereof; and 

60 f hybridizable fragments thereof 
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66. An isolated bacterial polypeptide comprising the amino acid sequence 
TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34), wherein the polypeptide 
is not isolated from Actinomyces. 

67. An isolated streptococcal polypeptide comprising the amino acid sequence 
65 TLLTCTPYMINS/THRLLVR/KG (SEQ ID NO: 34). 

68. An isolated bacterial polypeptide comprising the amino acid sequence 
TLVTCTPYGINTHRLLVTA (SEQ ID NO: 35). 

« 

69. An isolated bacterial polypeptide comprising the amino acid sequence 
TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36). 

70 70. An isolated streptococcal polypeptide comprising the amino acid sequence 
TLVTCTPYGVNTIOILLVRG (SEQ ID NO: 36). 

7 1 . An isolated polypeptide having the amino acid sequence selected from the 
group of 

TLLTCTPYMINS/TBDEILLVR/KG (SEQ ID NO: 34), TLVTCTPYGINTHRLLVTA 
75 (SEQ ID NO: 35), and TLVTCTPYGVNTKRLLVRG (SEQ ID NO: 36). 
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Figure 1. RDP type III-3 specific probes. Dot 
blot hybridization of probe DY1-1 with genomic 
ONA isolated from type III GBS. 10 ug of genomic 
DNA from each of 62 type ill GBS strains was 
transferred to nylon membrane. Radiolabeled 
probe 1 hybridized with DNA from ail III-3 strains 
(rows A-D) including the original type III-3 strain 
(well E-1). The probe failed to hybridize with DNA 
from III-2 strains (F1- F10, G1-7) including the 
original strain used in the subtraction hybridization 
(well E 10) and III-1 strains (wells H1-3; cf. Figure 
3), The same pattern of hybridization was 
observed using probe DY1-11. 
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EjnaA 

atg acc ctt gtt aaa aat caa gat get ctt gat aaa get aet gca aat 

Met Thr Leu Val Lya Aan Gin Asp Ala Leu Asp Lys Ala Thr Ala Asn 

15 10 15 

aca gat gat gcg gca ttt ttg gaa att cca gtt gca tea act att aat 

Thr Asp Asp Ala Ala Phe Leu Glu He Pro Val Ala Ser Thr He Asn 

20 25 30 

gaa aaa gca gtt tta gga aaa gca att gaa aat act ttt gaa ctt caa 

Glu Lys Ala Val Leu Gly Lys Ala He Glu Asn Thr Phe Glu Leu Gin 
35 40 45 

tat gac cat act cct gat aaa get gac aat cca aaa cca tct aat cct 

Tyr Asp His Thr Pro Asp Lys Ala Asp Asn Pro Lys Pro Ser Aan Pro 
50 55 60 

cca aga aaa cca gaa gtt cat act ggt ggg aaa cga ttt gta aag aaa 

Pro Arg Lys Pro Glu Val His Thr Gly Gly Lys Arg Phe Val Lys Lys 

65 70 75^ 80 

gac tea aca gaa aca caa aca eta ggt ggt get gag ttt gat ttg ttg 

ASP Ser Thr Glu Thr Gin Thr Leu Gly Gly Ala Glu Phe Asp Leu Leu 

85 90 95 

get tct gat ggg aca gca gta aaa tgg aca gat get ctt att aaa gcg 335 

Ala Ser Asp Gly Thr Ala Val Lys Trp Thr Asp Ala Leu He Lys Ala 
100 105 



aat act aat aaa aac tat att get gga gaa get gtt aet ggg caa cca 
Asn Thr Asn Lys Asn Tyr He Ala Gly Glu Ala Val Thr Gly Gin Pro 
115 120 125 

ate aaa ttg aaa tea cat aca gac ggt acg ttt gag att aaa ggt ttg 
He Lys Leu Lys Ser His Thr Asp Gly Thr Phe Glu He Lys Gly Leu 
130 135 140 

get tat gca gtt gat gcg aat gca gag ggt aca gca gta act tac aaa 
Ala Tyr Ala Val Asp Ala Asn Ala Glu Gly Thr Ala Val Thr Tyr Lys 
145 150 155 160 

tta aaa gaa aca aaa gca cca gaa ggt tat gta ate cct gat aaa gaa 
Leu Lys Glu Thr Lys Ala Pro Glu Gly Tyr Val He Pro Asp Lys Glu 
165 170 175 

ate gag ttt aca gta tea caa aca tct tat aat aca aaa cca act gac 
He Glu Phe Thr Val Ser Gin Thr Ser Tyr Asn Thr Lys Pro Thr Asp 
180 185 190 

ate acg gtt gat agt get gat gca aca cct gat aca att aaa aac aac 
He Thr Val Asp Ser Ala Asp Ala Thr Pro Asp Thr He Lys Asn Asn 
195 200 205 

aaa cgt cct tea ate cct aat act ggt ggt att ggt acg get ate ttt 
Lys Arg Pro Ser He Pro Asn Thr Gly Gly He Gly Thr Ala He Phe 
210 215 220 

gtc get ate ggt get gcg gtg atg get ttt get gtt aag ggg atg aag 
Val Ala He Gly Ala Ala Val Met Ala Phe Ala Val Lys Gly Met Lys 
225 230 235 240 

cgt cgt aca aaa gat aac taa 
Arg Arg Thr Lys Asp Asn 
245 
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EmaB 

atg aaa caa aca tta aaa ctt atg ttt tct ttt ctg ttg atg tta ggg 
Met Lys Gin Thr Leu Lys Leu Met Phe Ser Phe Leu Leu Met Leu Gly 

1 5 10 15 

act atg ttt gga att age caa act gtt tta gcg caa gaa act cat cag 

Thr Met Phe Gly He Ser Gin Thr Val Leu Ala Gin Glu Thr His Gin 

20 25 30 

ttg acg att gtt cat ctt gaa gca agg gat att gat cgt cca aat cca 

Leu Thr He Val His Leu Glu Ala Arg Asp He Asp Arg Pro Asn Pro 

35 40 45 

cag ttg gag att gcc cct aaa gaa ggg act cca att gaa gga gta etc 

Gin Leu Glu He Ala Pro Lys Glu Gly Thr Pro He Glu Gly Val Leu 

50 55 60 

tat cag ttg tac caa tta aaa tea act gaa gat ggc gat ttg ttg gca 

Tyr Gin Leu Tyr -Gin Leu Lys Ser Thr Glu Asp Gly Asp Leu Leu Ala 

65 70 75 80 

cat tgg aat tec eta act ate aea gaa ttg aaa aaa cag gcg cag cag 

His Trp Asn Ser Leu Thr He Thr Glu Leu Lys Lys Gin Ala Gin Gin 

85 90 95 

gtt ttt gaa gcc act act aat caa caa gga*liag get aca ttt aae caa 

Val Phe Glu Ala Thr Thr Asn Gin Gin Gly Lys Ala Thr Phe Asn Gin 

100 105 110 

eta cca gat gga att tat tat ggt ctg gcg gtt aaa gcc ggt gaa aaa 

Leu Pro Asp Gly He Tyr Tyr Gly Leu Ala Val Lys Ala Gly Glu Lys 

lis 120 125 

aat cgt aat gtc tea get ttc ttg gtt gac ttg tct gag gat aaa gtg 

Asn Arg Asn Val Ser Ala Phe Leu Val Asp Leu Ser Glu Asp Lys Val 

130 135 140 

att tat cct aaa ate ate tgg tec aca ggt gag ttg gac ttg ctt aaa 

He Tyr Pro Lys He He Trp Ser Thr Gly Glu Leu Asp Leu Leu Lys 

145 150 155 160 

gtt ggt gtg gat ggt gat ace aaa aaa cca eta gca ggc gtt gtc ttt 

Val Gly Val Asp Gly Asp Thr Lys Lys Pro Leu Ala Gly Val Val Phe 

165 170 175 

gaa ctt tat gaa aag aat ggt agg act cct att cgt gtg aaa aat ggg 

Glu Leu Tyr Glu Lys Asn Gly Arg Thr Pro He Arg Val Lys Asn Gly 

180 185 190 

gtg cat tct caa gat att gac get gca aaa cat tta gaa aca gat tea 

val His Ser Gin Asp He Asp Ala Ala Lys His Leu Glu Thr Asp Ser 

. 195 200 205 

tea ggg cat ate aga att tec ggg etc ate cat ggg gac tat gtc tta 

Ser Gly His He Arg He Ser Gly Leu He His Gly Asp Tyr Val Leu 

210 215 220 

aaa gaa ate gag aca cag tea gga tat cag ate gga cag gca gag act 

Lys Glu He Glu Thr Gin Ser Gly Tyr Gin He Gly Gin Ala Glu Thr 

225 230 235 240 

get gtg act att gaa aaa tea aaa aca gta aca gta acg att gaa aat 

Ala Val Thr He Glu Lys Ser Lys Thr Val Thr Val Thr He Glu Asn 

245 250 255 

aaa aaa gtt ccg aca cct aaa gtg cca tct cga gga ggt ctt att ccc 

Lys Lys Val Pro Thr Pro Lys Val Pro Ser Arg Gly Gly Leu He Pro 

260 265 270 
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aaa aca ggt gag caa cag 
Lys Thr Gly Glu Gin Gin 
275 

tta att get tta gcc tta 
Leu lie Ala Leu Ala Leu 
290 

aat aag gat tag 
Asn Lys Asp 
305 



gca atg gca ctt gta att 
Ala Met Ala Leu Val lie 

280 

cga tta eta tea aaa cat 
Arg Leu Leu Ser Lys His 
295 300 



att ggt ggt att 864 

lie Gly Gly He 

285 

egg aaa cat caa 912 
Arg Lys His Gin 



924 
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£aiaC 

atQ QQa caa aaa tea aaa ata tct eta get acg aat att cgt ata tgg 

Met Gly Gin Lys Ser Lys lie Ser Leu Ala Thr Aan lie Arg Il« Trp 

^ -f 5 10 13 

att ttt cgt tta att ttc tta gcg ggt ttc ctt gtt ttg gca tCt ccc 

lie She Arg Leu He Phe Leu Ala Gly Phe Leu Val Lea Ala Phe PCO 
20 25 30 

ate att agt cag gtc atg tac ttt caa gcc tct cac gcc aat att aat 

ill sir Gin val Met Tyr Phe Gin Ala Ser His Ala Aan He Asn 
35 

act ttt aaa gaa get gtt acc aag att gac egg gtg gag att aat egg 

III Phe Lys Glu Ala Val Thr Lya He Asp Arg Val Gla He Asn Arg 

SO 55 ' 60 

cat tta gaa ctt get tat get tat aac gcc agt ata gca ggt gcc aaa 

ftrg Leu Glu Leu Ala Tyr Ala Tyr Asn Ala Ser He Ala Gly Ala Lys 

€5 70 75 

act aat ggc gaa tat cca gcg ctt aaa gac ccc tac tct get gaa caa 

?hr Gly Glu Tyr Pro Ala Leu Lys Asp Pro Tyr Ser Ala Glu Glci 
S S 

aaa cag gca ggg gtc gtt gag tac gcc cgc atg ,ct,t gaa gtc aaa gaa 

111 Gin Ala Ily Val Val Glu Tyr Ala Arg Met L«a Glu Val Lys Glu 

^ . 100 105 

raa ata agt cat gtg att att eca aga att aat cag gat ate cct att 

Gin He Gly His Val He He Pro Arg He Asa Gin Asp He Pro He 

115 120 125 

tac act ggc tct get gaa gaa aat ctt cag agg ggc gtt gga eat tta 

Tyr Ala Gly Ser Ala Glu Glu Asn Leu Gin Arg Gly Val Gly Hxs Leu 
130 

gaq ggg acc agt ctt cca gtc ggt ggt gag tea act cat gcc gtt eta 

III III Thr Ser Leu Pro Val Gly Gly Glu Ser Thr His Ala Val Leu 

145 ISO 155 1«0 

act acc cat cga ggg eta cca acg gcc aag eta ttt acc aat tta gac 

?hr III Ss A?g Gly Leu Pro Thr Ala Lys Leu Phe Thr Asn Leu Asp 



165 



180 



210 



tac aca cet tat atg ata aat agt cat cgc etc etc gtt cga ggc aag 
CVS Thr Pro Tyr Met He Asn Ser His Arg Leu Leu Val Arg Gly Lys 
225 230 23S 240 

cga att cct tat gtg gaa aaa dca gtg cag aaa gat tea aag acc ttc 
Arg He Pro Tyr Val Glu Lys Thr Val Gin Lys Asp Ser Lys The Phe 
245 250 255 

agq caa caa caa tac eta acc tat get atg tgg gta gtc gtt gga ctt 
Arg Gin Gin Gin Tyr Leu Thr Tyr Ala Met Trp Val Val Val Gly Leu 
260 265 270 

ate ttg ctg teg ctt etc att tgg ttt aaa aag acg aaa cag aaa aag 
He Leu LGU Ser Leu Leu He Ttp She Lys Lys Thr Lys Gin Lys Lys 

275 280 285 

egg aga aag aat gaa aaa gcg get agt caa aat agt cac aat aat teg 
Arg Arg Lys Asn Glu Lys Ala Ala Ser Gin Asn Ser His Asn Asn Ser 
290 295 300 
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aaa ata aca gta ggt gac cgt ttt tac att gaa cac ate ggc gga aag 57 6 
III vTl vlx lly ASP Arg Phe Tyr He Glu His He Gly Gly Lys 

185 190 



art- oct tat cag gta gac caa ate aaa gtt ate gcc ect gat cag tta 624 
He Ala ^yr Gin val Lp Gin He. Lys Val He Ala Pro Asp Gin Leu 
jg5 200 205 

gag gat ttg tac gtg att caa gga gaa gat cac gtc acc eta tta act 672 
Glu Asp Leu Tyr Val He Gin Gly Glu Asp His Val Thr Leu Leu Thr 



aaa taa 

Lys 

305 



720 



768 



816 



864 



912 



918 



FIGURE 4 



5/10 



wo 02/12294 



PCT/USOl/24795 



atg aaa aag egg eta gtc aaa ata gtc aca ata att cga aat aat aaa 48 
Met Lys Lys Arg Leu VaX Lys He Val Thr He He Arg Asn Asn Lys 
15 10 15 

ate aga acc etc att ttt gtg atg gga agt ctg att etc tta ttt ccg 96 
Zle Arg Thr Leu He Phe Val Met Gly Ser Leu He Leu Leu Phe Pro 
20 25 30 

att gtg age cag gta agt tac tac ctt get teg cat caa aat att aat 144 
He Val Ser Gin Val Ser Tyr Tyr Leu Ala Ser His Gin Asn He Asn 
35 40 45 

caa ttt aag egg gaa gtc get aag att gat act aat acg gtt gaa ega 192 
Gin Phe Lys Arg Glu Val Ala Lys He Asp Thr Asn Thr Val Glu Arg 
50 55 60 

cge ate get tta get aat get tac aat gag acg tta tea agg aat ccc 240 
Arg He Ala Leu Ala Asn Ala Tyr Asn Glu Thr Leu Ser Arg Asn Pro 
65 70 75 80 



ttg ctt ata gac ect ttt acc agt aag caa aaa gaa ggt ttg aga gag 
Leu Leu He Asp Pro Phe Thr Ser Lys Gin LyW Glu Gly Leu Arg Glu 
85 90 95 



288 



tat get cgt atg ctt gaa gtt cat gag caa ata ggt cat gtg gca ate 336 
Tyr Ala Arg Met Leu Glu Val His Glu Gin He Gly His Val Ala He 
100 105 110 

cca agt at.t ggg gtt gat att cca att tat get gga aca tec gaa act* 384 
Pro Ser He Gly Val Asp He Pro He Tyr Ala Gly Thr Ser Glu Thr 
115 120 125 

gtg ctt cag aaa ggt agt ggg cat ttg gag gga ace agt ctt cca gtg 432 
Val Leu Gin Lys Gly Ser Gly His Leu Glu Gly Thr Ser Leu Pro Val 
130 135 140 

gga ggt ttg tea acc cat tea gta eta act gee cac cgt ggc ttg cca 480 
Gly Gly Leu Ser Thr His Ser Val Leu Thr Ala His Arg Gly Leu Pro 
145 150 155 160 

aca get agg eta ttt acc gac tta aat aaa gtt aaa aaa ggc cag att 528 
Thr Ala Arg Leu Phe Thr Asp Leu Asn Lys Val Lys Lys Gly Gin He 
165 170 175 

ttc tat gtg acg aac ate aag gaa aca ctt gcc tac aaa gtc gtg tet 57 6 
Phe Tyr Val Thr Asn He Lys Glu Thr Leu Ala Tyr Lys Val Val Ser 
180 185 190 

ate aaa gtt gtg gat cca aca get tta agt gag gtt aag att gtc aat 624 
He Lys Val Val Asp Pro Thr Ala Leu Ser Glu Val Lys He Val Asn 
195 200 205 

ggt aag gat tat ata acc ttg ctg act tgc aca cet tac atg ate aat 672 
Gly Lys Asp Tyr He Thr Leu Leu Thr Cys Thr Pro Tyr Met He Asn 
210 215 . 220 

agt eat cgt etc ttg gta aaa gga gag cgt att cet tat gat tct acc 720 
Ser His Arg Leu Leu Val Lys Gly Glu Arg He Pro Tyr Asp Ser Thr 
225 230 235 240 

gag gcg gaa aag cac aaa gaa caa acc gta caa gat tat cgt ttg tea 768 
Glu Ala Glu Lys His Lys Glu Gin Thr Val Gin Asp Tyr Arg Leu Ser 
245 250 255 

eta gtg ttg aag ata eta eta gta tta tta att gga etc ttc ate gtg 816 
Leu Val Leu Lys He Leu Leu Val Leu Leu He Gly Leu Phe He Val 
260 265 270 



ata atg atg aga aga tgg atg caa cat cgt caa taa 
He Met Met Arg Arg Trp Met Gin His Arg Gin 
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EmaE 

atg atg att gtg aat aat ggt tat eta gaa ggg aga aaa atg aaa aag 
Met Met lie Val Asn Asn Gly Tyr Leu Glu Gly Arg Lys Met Lys Lys 
15 10 15 

aga caa aaa ata tgg aga ggg tta tea gtt act tta eta ate ctg tec 
Arg Gin I»ys He Trp Arg Gly Leu Ser Val Thr Leu Leu He Leu Ser 
20 25 30 

caa att cca ttt ggt ata ttg gta caa ggt gaa acc caa gat acc aat 
Gin lie Pro Phe Gly He Leu Val Gin Gly Glu Thr Gin Asp Thr Asn 
35 40 45 

caa gca ctt gga aaa gta att gtt aaa aaa acg gga gac aat get aca 
Gin Ala Leu Gly Lys Val He Val Lys Lys Thr Gly Asp Asn. Ala Thr 
50 55 60 

cca tta ggc aaa gcg act ttt gtg tta aaa aat gac aat gat aag tea 
Pro Leu Gly Lys Ala Thr Phe Val Leu Lys Asn Asp Asn Asp Lys Ser 
65 ''O ''S ®° 

gaa aca agt cac gaa acg gta gag ggt tct gga gaa gca acc ttt gaa 
Glu Thr Ser His Glu Thr Val Glu Gly Ser Gly,,ALu Ala Thr Phe Glu 
85 90 95 



gaa caa tac aaa gca ttg aat cca ata aat gga aaa gat ggt cga aga 
Glu Gin Tyr Lys Ala Leu Asn Pro He Asn Gly Lys Asp Gly Arg Arg 
180 185 190 

gag att get gaa ggt tgg tta tea aaa aaa aat aca ggg gtc aat gat 
Glu He Ala Glu Gly Trp Leu Ser Lys Lys Asn Thr Gly Val Asn Asp 
195 200 205 

etc gat aag aat aaa tat aaa att gaa tta act gtt gag ggt aaa acc 
Leu Asp Lys Asn Lys Tyr Lys He Glu Leu Thr Val Glu Gly Lys Thr 

210 215 220 

act gtt gaa acg aaa gaa ctt aat caa cca eta gat gtc gtt gtg eta 
Thr Val Glu Thr Lys Glu Leu Asn Gin Pro Leu Asp Val Val Val Leu 
225 230 235 240 

tta gat aat tea aat agt atg aat aat- gaa aga gcc aat aat tct caa 
Leu Asp Asn Ser Asn Ser Met Asn Asn Glu Arg Ala Asn Asn Ser Gin 
245 250 255 

aga gca tta aaa get ggg gaa gca gtt gaa aag ctg att gat aaa att 
Arg Ala Leu Lys Ala Gly Glu Ala Val Glu Lys Leu He Asp Lys He 
260 265 270 

aca tea aat aaa gae aat aga gta get ctt gtg aca tat gcc tea acc 
Thr Ser Asn Lys Asp Asn Arg Val Ala Leu Val Thr Tyr Ala Ser Thr 



48 



96 



144 



192 



240 



288 



aac ata aaa cct gga gac tac aca tta aga gaa gaa aca gca cca att 336 
Asn He Lys Pro Gly Asp Tyr Thr Leu Arg Glu Glu Thr Ala Pro He 
100 105 110 

ggt tat aaa aaa act gat aaa acc tgg aaa gtt aaa gtt gca gat aac 384 

Gly Tyr Lys Lys Thr Asp Lys Thr Trp Lys Val Lys Val Ala Asp Asn 
115 120 125 

gga gca aca ata ate gag ggt atg gat gca gat aaa gca gag aaa cga 

Gly Ala Thr He He Glu Gly Met Asp Ala Asp Lys Ala Glu Lys Arg 
130 135 140 

aaa gaa gtt ttg aat gcc caa tat cca aaa tea get att tat gag gat 

Lys Glu Val Leu Asn Ala Gin Tyr Pro Lys Ser Ala He Tyr Glu Asp 

145 150 155 160 

aca aaa gaa aat tac cca tta gtt aat gta gag ggt. tec aaa gtt ggt 528 

Thr Lys Glu Asn Tyr Pro Leu Val Asn Val Glu Gly Ser Lys Val Gly 
165 1-70 175 



432 



480 



576 



624 



672 



720 



768 



816 



864 
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275 280 295 ■ 

att ttt gat ggt act gaa gcg acc gta tea aag gga gtt gcc gat caa 912 
lie Phe Asp Gly Thr Glu Ala Thr Val Ser Lys Gly Val Ala Asp Gin 
290 295 300 



aca aag aaa gtt tct gca acg aaa caa ate aaa act cat ggt gag cca 
Thr lys Lys val Ser Ala Thr tys Gin He Lys Thr His Gly Glu Pro 

515 520 525 



ttt act gtt ggg att ggt gta aac gga gat cct ggt gca act cot ctt 

Phe Thr Val Gly He Gly Val Asn Gly Asp Pro Gly Ala Thr Pro Leu 

545 550 555 560 

gaa get gag aaa ttt atg caa tea ata tea agt aaa aca gaa aat tat 



1008 



1056 



1104 



1152 



1200 



aat ggt aaa gcg ctg aat gat agt gta tea tgg gat tat cat aaa act 960 
Asn Gly Lys Ala Leu Asn Asp Ser Val Ser Trp Asp Tyr His Lys Thr 
305 310 315 320 

act ttt aca gca act aca cat aat tac agt tat tta aat tta aca aat 
Thr Phe Thr Ala Thr Thr His Asn Tyr Ser Tyr Leu Asn Leu Thr Asn 
325 330 335 

gat get aac gaa gtt aat att eta aag tea aga att cca aag gaa gcg 
Asp Ala Asn Glu Val Asn lie Leu Lys ser Arg He Pro Lys Glu Ala 
340 345 350 

gag cat ata aat ggg gat cgc acg etc tat caa ttt ggt gcg aca ttt 
Glu His He Asn Gly Asp Arg Thr Leu Tyr Gin Phe Gly Ala Thr Phe 

355 360 365 

act caa aaa get eta atg aaa gca aat gaa att^ ,tta gag aca caa agt 
Thr Gin Lys Ala Leu Met Lys Ala Asn Glu He Leu Glu Thr Gin Ser 
370 375 380 

tct aat get aga aaa aaa ctt att ttt cac gta act gat ggt gtc cct 
Ser Asn Ala Arg Lys Lys Leu He Phe His vai Thr Asp Gly Val Pro 
385 390 395 400 

acg atg tct tat gcc ata aat ttt aat cct tat ata tea aca tct tac 1248 
Thr Met Ser Tyr Ala He Asn Phe Asn Pro Tyr He Ser Thr Ser Tyr 
405 410 415 

caa aac cag ttt aat tct ttt tta aat aaa ata cca gat aga agt ggt 1296 
Gin Asn Gin Phe Asn Ser Phe Leu Asn Lys He Pro Asp Arg Ser Gly 
420 425 430 

att etc caa gag gat ttt ata ate aat ggt gat gat tat caa ata gta 
He Leu Gin Glu Asp Phe He He Asn Gly Asp Asp Tyr Gin He Val 
435 440 445 

aaa gga gat gga gag agt ttt aaa ctg ttt teg gat aga aaa gtt cct 
Lys Gly Asp Gly Glu Ser Phe Lys Leu Phe Ser Asp Arg Lys Val Pro 
450 455 460 

gtt act gga gga acg aca caa gca get tat cga gta ccg caa aat caa 
Val Thr Gly Gly Thr Thr Gin Ala Ala Tyr Arg Val Pro Gin Asn Gin 
465 470 475 480 

etc tct gta atg agt aat gag gga tat gca att aat agt gga tat att 
Leu Ser Val Met Ser Asn Glu Gly Tyr Ala He Asn Ser Gly Tyr He 
485 490 495 

tat etc tat tgg aga gat tac aac tgg gtc tat cca ttt gat cct aag 1536 
Tyr Leu Tyr Trp Arg Asp Tyr Asn Trp Val Tyr Pro Phe Asp Pro Lys 
500 505 510 



1344 



1392 



1440 



1488 



1584 



aca aca tta tac ttt aat gga aat ata aga cct aaa ggt tat gac att 1632 
Thr Thr Leu Tyr Phe Asn Gly Asn He Arg Pro Lys Gly Tyr Asp He 
530 535 540 



1680 



1728 
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1920 



Glu Ala Glu Lys Phe Met Gin Ser He Ser Ser Lys Thr Glu Asn Tyr 
565 570 575 

act aat gtt gat gat aca aat aaa att tat gat gag eta aat aaa tac 1776 
Thr Asn Val Asp Asp Thr Asn Lys lie Tyr Asp Glu Leu Asn Lys Tyr 
580 585 590 

ttt aaa aca att gtt gag gaa aaa cat tct att gtt gat gga aat gtg 1824 
Phe Lys Thr He Val Glu Glu Lys His Ser He Val Asp Gly Asn Val 

595 SOO 605 

act gat cct atg gga gag atg att gaa ttc caa tta aaa aat ggt caa 1872 
Thr Asp Pro Met Gly Glu Met He- Glu Phe Gin Leu Lys Asn Gly Gin 
610 615 620 

agt ttt aca cat gat gat tac gtt ttg gtt gga aat gat ggc agt caa 
ser Phe Thr His Asp Asp Tyr Val Leu Val Gly Asn Asp Gly Ser Gin 
625 630 635 640 

tta aaa aat ggt gtg get ctt ggt gga cca aac agt gat ggg gga att 196$ 
Leu Ly5 Asn Gly Val Ala Leu Gly Gly Pro Asn Ser Asp Gly Gly He 
64 5 650 655 

tta aaa gat gtt aca gtg act tat gat aag aca tct caa acc ate aaa 2016 
Leu Lys Asp Val Thr Val Thr Tyr Asp Lys Thr Ser Gin Thr He Lys 
660 665 670 

ate aat cat ttg aac tta gga agt gga caa aaa gta gtt ctt acc tat 
He Asn His Leu Asn Leu Gly Ser Gly Gin Lys Val Val Leu Thr Tyr 
675 6B0 685 

gat gta cgt tta aaa gat aac tat ata agt aac aaa ttt tac aat aca 
Asp Val Arg Leu Lys Asp Asn Tyr He Ser Asn Lys Phe Tyr Asn Thr 

690 695 700 

aat aat cgt aca acgr eta agt ccg aag agt gaa aaa gaa cca aat act 
Asn Asn Arg Thr Thr Leu Ser Pro Lys Ser Glu Lys Glu Pro Asn Thr 
705 710 715 720 

att cgt gat ttc cca att ccc aaa att cgt gat gtt cgt gag ttt ccg 
He Arg Asp Phe Pro He Pro Lys He Arg Asp Val Arg Glu Phe Pro 
725 730 735 

gta eta acc ate agt aat cag aag aaa atg ggt gag gtt gaa ttt att 2256 
Val Leu Thr He Ser Asn Gin Lys Lys Met Gly Glu Val Glu Phe He 
740 745 750 



2064 



2112 



2160 



2208 



aaa gtt aat aaa gac aaa cat tea gaa teg ctt ttg gga get aag ttt 2304 

Lys Val Asn Lys Asp Lys His Ser Glu Ser Leu Leu Gly Ala Lys Phe 
755 760 765 

caa ctt cag ata gaa aaa gat ttt tct ggg tat aag caa ttt gtt cca 2352 

Gin Leu Gin He Glu Lys Asp Phe Ser Gly Tyr Lys Gin Phe Val Pro 
770 775 780 

gag gga agt gat gtt aca aca aag aat gat ggt aaa att tat ttt aaa 2400 

Glu Gly Ser Asp Val Thr Thr Lys Asn Asp Gly Lys He Tyr Phe Lys 

785 790 795 800 

gca ctt caa gat ggt aac tat aaa tta tat gaa att tea agt cca gat 2448 

Ala Leu Gin Asp Gly Asn Tyr Lys Leu Tyr Glu He Ser Ser Pro Asp 

805 810 815 

ggc tat ata gag gtt aaa acg aaa cct gtt gtg aca ttt aca att caa 2496 

Gly Tyr He Glu Val Lys Thr Lys Pro Val Val Thr Phe Thr He Gin 
820 825 830 

aat gga gaa gtt acg aac ctg aaa gca gat cca aat get aat aaa aat 2544 

Asn Gly Glu Val Thr Asn Leu Lys Ala Asp Pro Asn Ala Asn Lys Asn 
835 840 845 
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caa ate ggg tat ctt gaa gga aat ggt 
Gin lie Gly Tyr Leu Glu Gly Asn Gly 
850 855 

ccc aaa cgc cca cca ggt gtt ttt cct 
Pro Lys Arg Pro Pro Gly Val Phe Pro 
865 870 

att gtc tat ata tta gtt ggt tct act 
He Val Tyr He Leu Val Gly Ser Thr 
885 

tct ttc cgt cgt aaa caa ttg taa 
Ser Phe Arg Arg Lys Gin Leu 
900 



aaa cat ctt att acc aac act 2592 
Lys His Leu rie Thr Asn Thr 
860 

aaa aca ggg gga att ggt aca 2640 
Lys Thr Gly Gly He Gly Thr 
875 880 

ttt atg ata ctt acc att tgt 2688 
Phe Met lie Leu Thr He Cya 
890 895 

2712 
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SEQUENCE LISTING 

<110> Adderson, Elisabeth 
Bohnsack, John 

<120> GROUP B STREPTOCOCCUS POYPEPTIDES NUCLEIC ACIDS AND 
THERAPEUTIC COMPOSITIONS AND VACCINES THEREOF 

<130> 2511-1-001 

<140> UNKNOWN 
<141> 2000-08-08 

<160> 37 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 737 

<212> DNA 

<213> Streptococcus agalactiae 
<400> 1 

atgacccttg ttaaaaatca agatgctctt gataaagcta ctgcaaatac agatgatgcg 60 
gcatttttgg aaattccagt tgcatcaact attaatgaaa aagcagtttt aggaaaagca 120 
attgaaaata cttttgaact tcaatatgac catactcctg ataaagctga caatccaaaa 180 
ccatctaatc ctccaagaaa accagaagtt catactggtg ggaaacgatt tgtaaagaaa 24 0 
gactcaacag aaacacaaac actaggtggt gctgagtttg atttgttggc ttctgatggg 300 
acagcagtaa aatggacaga tgctcttatt aaagcgaata ctaataaaaa ctatattgct 360 
ggagaagctg ttactgggca accaatcaaa ttgaaatcac atacagacgg tacgtttgag 420 
attaaaggtt tggcttatgc agttgatgcg aatgcagagg gtacagcagt aacttacaaa 480 
ttaaaagaaa caaaagcacc agaaggttat gtaatccctg ataaagaaat cgagtttaca 540 
gtatcacaaa catcttataa tacaaaacca actgacatca cggttgatag tgctgatgca 600 
acacctgata caattaaaaa caacaaacgt ccttcaatcc dtaatactgg tggtattggt 660 
acggctatct ttgtcgctat cggtgctgcg gtgatggctt ttgctgttaa ggggatgaag 720 
cgtcgtacaa aagataa 737 

1 
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<210> 2 
<211> 245 
• <212> P.RT 
<213> Streptococcus agalactiae 

<400> 2 

Met Thr Leu Val Lys Asn Gin Asp Ala Leu Asp Lys Ala Thr Ala Asn 
15 10 15 

Thr Asp Asp Ala Ala Phe Leu Glu lie Pro Val Ala Ser Thr lie Asn 
20 25 30 

Glu Lys Ala Val Leu Gly Lys Ala lie Glu Asn Thr Phe Glu Leu Gin 
35 40 45 

Tyr Asp His Thr Pro Asp Lys Ala Asp Asn Pro Lys Pro Ser Asn Pro 
50 55 ' 60 

Pro Arg Lys Pro Glu Val His Thr Gly Gly Lys Arg Phe Val Lys Lys 
65 70 75 80 

Asp Ser Thr Glu Thr Gin Thr Leu Gly Gly Ala Glu Phe Asp Leu Leu 
85 90 95 

Ala Ser Asp Gly Thr Ala Val Lys Trp Thr Asp Ala Leu lie Lys Ala 
100 105 110 

Asn Thr Asn Lys Asn Tyr lie Ala Gly Glu Ala Val Thr Gly Gin Pro 
115 120 125 

He Lys Leu Lys Ser His Thr Asp Gly Thr Phe Glu He Lys Gly Leu 
130 135 140 

Ala Tyr Ala Val Asp Ala Asn Ala Glu Gly Thr Ala Val Thr Tyr Lys 
145 150 ' 155 160 

Leu Lys Glu Thr Lys Ala Pro Glu Gly Tyr Val He Pro Asp Lys Glu 
165 170 175 

He Glu Phe Thr Val Ser Gin Thr Ser Tyr Asn Thr Lys Pro Thr Asp 
180 185 190 

He Thr Val Asp Ser Ala Asp Ala Thr Pro Asp Thr He Lys Asn Asn 
195 200 205 

Lys Arg Pro Ser He Pro Asn Thr Gly Gly He Gly Thr Ala He Phe 
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.210 215 220 

Val Ala He Gly Ala Ala Val Met Ala Phe Ala Val Lys Gly Met Lys 
225 230 235 240 

Arg Arg Thr Lys Asp 
245 

<210> 3 
<211> 924 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 3 

atgaaacaaa cattaaaact tatgttttct tttctgttga tgttagggac tatgtttgga 60 
attagccaaa ctgttttagc gcaagaaact catcagttga cgattgttca tcttgaagca 120 
agggatattg atcgtccaaa tccacagttg gagattgccc ctaaagaagg gactccaatt 180 
gaaggagtac tctatcagtt gtaccaatta aaatcaactg aagatggcga tttgttggca 24 0 
cattggaatt ccctaactat cacagaattg aaaaaacagg cgcagcaggt ttttgaagcc 300 
actactaatc aacaaggaaa ggctacattt aaccaactac cagatggaat ttattatggt 360 
ctggcggtta aagccggtga aaaaaatcgt aatgtctcag ctttcttggt tgacttgtct 420 
gaggataaag tgatttatcc taaaatcatc tggtccacag gtgagttgga cttgcttaaa 480 
gttggtgtgg atggtgatac caaaaaacca ctagcaggcg ttgtctttga actttatgaa 540 
aagaatggta ggactcctat tcgtgtgaaa aatggggtgc attctcaaga tattgacgct 600 
gcaaaacatt tagaaacaga ttcatcaggg catatcagaa tttccgggct catccatggg 660 
gactatgtct taaaagaaat cgagacacag tcaggatatc agatcggaca ggcagagact 720 
gctgtgacta ttgaaaaatc aaaaacagta acagtaacga ttgaaaataa aaaagttccg 780 
acacctaaag tgccatctcg aggaggtctt attcccaaaa caggtgagca acaggcaatg 840 
gcacttgtaa ttattggtgg tattttaatt gctttagcct tacgattact atcaaaacat 900 
cggaaacatc aaaataagga ttag 924 
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<210> 4 
<211> 307 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 4 

Met Lys Gin Thr Leu Lys Leu Met Phe Ser Phe Leu Leu Met Leu Gly 
15 10 15 

Thr Met Phe Gly lie Ser Gin Thr Val Leu Ala Gin Glu Thr His Gin 
20 25 30 

Leu Thr lie Val His Leu Glu Ala Arg Asp lie Asp Arg Pro Asn Pro 
35 40 45 

Gin Leu Glu lie Ala Pro Lys Glu Gly Thr Pro lie Glu Gly Val Leu 
50 55 60 

Tyr Gin Leu Tyr Gin Leu Lys Ser Thr Glu Asp Gly Asp Leu Leu Ala 
65 70 75 80' 

His Trp Asn Ser Leu Thr lie Thr Glu Leu Lys Lys Gin Ala Gin Gin 
85 90 95 

Val Phe Glu Ala Thr Thr Asn Gin Gin Gly Lys Ala Thr Phe Asn Gin 
100 105 110 

Leu Pro Asp Gly lie Tyr Tyr Gly Leu Ala Val Lys Ala Gly Glu Lys 
115 120 125 

Asn Arg Asn Val Ser Ala Phe Leu Val Asp Leu Ser Glu Asp Lys Val 
130 135 140 

lie Tyr Pro Lys lie lie Trp Ser Thr Gly Glu Leu Asp Leu Leu Lys 
145 150 155 160 

Val Gly Val Asp Gly Asp Thr Lys Lys Pro Leu Ala Gly Val Val Phe 
165 170 175 

Glu Leu Tyr Glu Lys Asn Gly Arg Thr Pro lie Arg Val Lys Asn Gly 
180 185 190 

Val His Ser Gin Asp lie Asp Ala Ala Lys His Leu Glu Thr Asp Ser 
195 200 205 

Ser Gly His lie Arg lie Ser Gly Leu lie His Gly Asp Tyr Val Leu 
210 215 220 
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Lys Glu lie Glu Thr Gin Ser Gly Tyr Gin He Gly Gin Ala Glu Thr 
225 230 235 240 

Ala Val Thr He Glu Lys Ser Lys Thr Val Thr Val Thr He Glu Asn 

245 ' 250 255 

Lys Lys Val Pro Thr Pro Lys Val Pro Ser Arg Gly Gly Leu He Pro 
260 265 270 

Lys Thr Gly Glu Gin Gin Ala Met Ala Leu Val He He Gly Gly He 
275 280 285 

Leu He Ala Leu Ala Leu Arg Leu Leu Ser Lys His Arg Lys His Gin 
290 295 300 

Asn Lys Asp 
305 



<210> 5 
<211> 918 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 5 

atgggacaaa aatcaaaaat atctctagct acgaatattc gtatatggat ttttcgttta 60 
attttcttag cgggtttcct tgttttggca tttcccatcg ttagtcaggt catgtacttt 120 
caagcctctc acgccaatat taatgctttt aaagaagctg ttaccaagat tgaccgggtg 180 
gagattaatc ggcgtttaga acttgcttat gcttataacg ccagtatagc aggtgccaaa 240 
actaatggcg aatatccagc gcttaaagac ccctactctg ctgaacaaaa gcaggcaggg 300 
gtcgttgagt acgcccgcat gcttgaagtc aaagaacaaa taggtcatgt gattattcca 360 
agaattaatc aggatatccc tatttacgct ggctctgctg aagaaaatct tcagaggggc 420 
gttggacatt tagaggggac cagtcttcca gtcggtggtg agtcaactca tgccgttcta 480 
actgcccatc gagggctacc aacggccaag ctatttacca atttagacaa ggtaacagta 54 0 
ggtgaccgtt tttacattga acacatcggc ggaaagattg cttatcaggt agaccaaatc 600 
aaagttatcg, cccctgatca gttagaggat ttgtacgtga ttcaaggaga agatcacgtc 660 
accctattaa cttgcacacc ttatatgata aatagtcatc gcctcctcgt tcgaggcaag 720 

5 
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cgaattcctt atgtggaaaa aacagtgcag aaagattcaa agaccttcag gcaacaacaa 780 
tacctaacct atgctatgtg ggtagtcgtt ggacttatct tgctgtcgct tctcatttgg 840 
tttaaaaaga cgaaacagaa aaagcggaga aagaatgaaa aagcggctag tcaaaatagt 900 

918 

cacaataatt cgaaataa 



<210> 6 
<211> 305 

<212> PRT 

<213> Streptococcus agalactiae 
<400> 6 

Met Gly Gin Lys Ser Lys He Ser Leu Ala Thr Asn He Arg He Trp 
1 5 10 15 

He Phe Arg Leu He Phe Leu Ala Gly Phe Leu Val Leu Ala Phe Pro' 
20 25 30 

He Val Ser Gin Val Met Tyr Phe Gin Ala Ser His Ala Asn He Asn 
35 40 45 

Ala Phe Lys Glu Ala Val Thr Lys He Asp Arg Val Glu He Asn Arg 
50 55 60 

Arg Leu Glu Leu Ala Tyr Ala Tyr Asn Ala Ser He Ala Gly Ala Lys 
65 70 75 80 

Thr Asn Gly Glu Tyr Pro Ala Leu Lys Asp Pro Tyr Ser Ala Glu Gin 
85 90 95 

Lys Gin Ala Gly Val Val Glu Tyr Ala Arg Met Leu Glu Val Lys Glu 
100 105 110 

Gin He Gly His Val He He Pro Arg He Asn Gin Asp He Pro He 
115 120 125 

Tyr Ala Gly Ser Ala Glu Glu Asn Leu Gin Arg Gly Val Gly His Leu 
130 135 140 

Glu Gly Thr Ser Leu Pro Val Gly Gly Glu Ser Thr His Ala Val Leu 
145 150 155 160 

Thr Ala His Arg Gly Leu Pro Thr Ala Lys Leu Phe Thr Asn Leu Asp 
165 170 175 
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Lys Val Thr Val Gly Asp Arg Phe Tyr lie Glu His He Gly Gly Lys 
180 185 190 

He Ala Tyr Gin Val Asp Gin He Lys Val He Ala Pro Asp Gin Leu 
195 200 205 

Glu Asp Leu Tyr Val He Gin Gly Glu Asp His Val Thr Leu Leu Thr 
210 215 220 

Cys Thr Pro Tyr Met He Asn Ser His Arg Leu Leu Val Arg Gly Lys 
225 230 235 240 

Arg He Pro Tyr Val Glu Lys Thr Val Gin Lys Asp Ser Lys Thr Phe 
245 250 255 

Arg Gin Gin Gin Tyr Leu Thr Tyr Ala Met Trp Val Val Val Gly Leu 
260 265 270 

He Leu Leu Ser Leu Leu He Trp Phe Lys Lys Thr Lys Gin Lys Lys^ 
275 280 285 

Arg Arg Lys Asn Glu Lys Ala Ala Ser Gin Asn Ser His Asn Asn Ser 
290 295 300 

Lys 
305 



<210> 7 
<211> 852 
<212> DNA 

<213> Streptococcus agalactiae 
<400> 7 

atgaaaaagc ggctagtcaa aatagtcaca ataattcgaa ataataaaat cagaaccctc 60 
atttttgtga tgggaagtct gattctctta tttccgattg tgagccaggt aagttactac 120 
cttgettcgc atcaaaatat taatcaattt aagcgggaag tcgctaagat tgatactaat 180 
acggttgaac gacgcatcgc tttagctaat gcttacaatg agacgttatc aaggaatccc 240 
ttgcttatag acccttttac cagtaagcaa aaagaaggtt tgagagagta tgctcgtatg 300 
cttgaagttc atgagcaaat aggtcatgtg gcaatcccaa gtattggggt tgatattcca 360 
atttatgctg gaacatccga aactgtgctt cagaaaggta gtgggcattt ggagggaacc 4 20 

7 
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agtcttccag tgggaggttt gtcaacccat tcagtactaa ctgcccaccg tggcttgcca 480 

acagctaggc tatttaccga cttaaataaa gttaaaaaag gccagatttt ctatgtgacg 540 

aacatcaagg aaacacttgc ctacaaagtc gtgtctatca aagttgtgga tccaacagct 600 

ttaagtgagg ttaagattgt caatggtaag gattatataa ccttgctgac ttgcacacct 660 

tacatgatca atagtcatcg tctcttggta aaaggagagc gtattcctta tgattctacc 720 

gaggcggaaa agcacaaaga acaaaccgta caagattatc gtttgtcact agtgttgaag 780 

atactactag tattattaat tggactcttc atcgtgataa tgatgagaag atggatgcaa 840 

catcgtcaat aa 852 



<210> 8 

<211> 283 
<212> PRT 

<213> Streptococcus agalactiae 

<400> 8 

Met Lys Lys Arg Leu Val Lys lie Val Thr lie lie Arg Asn Asn Lys 
15 10 15 

He Arg Thr Leu He Phe Val Met Gly Ser Leu He Leu Leu Phe Pro 
20 25 30 

He Val Ser Gin Val Ser Tyr Tyr Leu Ala Ser His Gin Asn He Asn 
35 40 45 

Gin Phe Lys Arg Glu Val Ala Lys He Asp Thr Asn Thr Val Glu Arg 
50 55 60 

Arg He Ala Leu Ala Asn Ala Tyr Asn Glu Thr Leu Ser Arg Asn Pro 
65 70 75 80 

Leu Leu He Asp Pro Phe Thr Ser Lys Gin Lys Glu Gly Leu Arg Glu 
85 90 95 

Tyr Ala Arg Met Leu Glu Val His Glu Gin He Gly His Val Ala He 
100 105 110 

Pro Ser He Gly Val Asp He Pro He Tyr Ala Gly Thr Ser Glu Thr 
115 120 125 
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Val Leu Gin Lys Giy Ser Gly His Leu Glu Gly Thr Ser Leu Pro Val 
130 135 140 

Gly Gly Leu Ser Thr His Ser Val Leu Thr Ala His Arg Gly Leu Pro 
145 150 155 160 

Thr Ala Arg Leu Phe Thr Asp Leu Asn Lys Val Lys Lys Gly Gin lie 
165 170 175 

Phe Tyr Val Thr Asn lie Lys Glu Thr Leu Ala Tyr Lys Val Val Ser 
180 185 190 

lie Lys Val Val Asp Pro Thr Ala Leu Ser Glu Val Lys lie Val Asn 
195 200 205 

Gly Lys Asp Tyr lie Thr Leu Leu Thr Cys Thr Pro Tyr Met lie Asn 
210 215 220 

Ser His Arg Leu Leu Val Lys Gly Glu Arg lie Pro Tyr Asp Ser Thr 
225 230 235 240 

Glu Ala Glu Lys His Lys Glu Gin Thr Val Gin Asp Tyr Arg Leu Ser 
245 250 255 

Leu Val Leu Lys lie Leu Leu Val Leu Leu lie Gly Leu Phe lie Val 
260 265 270 

He Met Met Arg Arg Trp Met Gin His Arg Gin 
275 280 



<210> 9 
<211> 2712 
<212> DNA 

<213> Streptococcus agalactiae 

<400> 9 

atgatgattg tgaataatgg ttatctagaa gggagaaaaa tgaaaaagag acaaaaaata 60 
tggagagggt tatcagttac tttactaatc ctgtcccaaa ttccatttgg tatattggta 120 
caaggtgaaa cccaagatac caatcaagca cttggaaaag taattgttaa aaaaacggga 180 
gacaatgcta caccattagg caaagcgact tttgtgttaa "aaaatgacaa tgataagtca 240 
gaaacaagtc acgaaacggt agagggttct ggagaagcaa cctttgaaaa cataaaacct 300 
ggagactaca cattaagaga agaaacagca ccaattggtt ataaaaaaac tgataaaacc 360 

9 
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tggaaagtta aagttgcaga taacggagca acaataatcg agggtatgga tgcagataaa 420 
gcagagaaac gaaaagaagt tttgaatgcc caatatccaa aatcagctat ttatgaggat 480 
acaaaagaaa attacccatt agttaatgta gagggttcca aagttggtga acaatacaaa 540' 
gcattgaatc caataaatgg aaaagatggt cgaagagaga ttgctgaagg ttggttatca 600 
aaaaaaaata caggggtcaa tgatctcgat aagaataaat ataaaattga attaactgtt 660 
gagggtaaaa ccactgttga aacgaaagaa cttaatcaac cactagatgt cgttgtgcta 720 
ttagataatt caaatagtat gaataatgaa agagccaata attctcaaag agcattaaaa 780 
gctggggaag cagttgaaaa gctgattgat aaaattacat caaataaaga caatagagta 840 
gctcttgtga catatgcctc aaccattttt gatggtactg aagcgaccgt atcaaaggga 900 
gttgccgatc aaaatggtaa agcgctgaat gatagtgtat catgggatta tcataaaaot 960 
acttttacag caactacaca taattacagt tatttaaatt taacaaatga' tgctaacgaa 1020 
gttaatattc taaagtcaag aattccaaag gaagcggagc atataaatgg ggatcgcacg 1080 
ctctatcaat ttggtgcgac atttactcaa aaagctctaa tgaaagcaaa tgaaatttta 1140 
gagacacaaa gttctaatgc tagaaaaaaa cttatttttc acgtaactga tggtgtccct 1200 
acgatgtctt atgccataaa ttttaatcct tatatatcaa catcttacca aaaccagttt 1260 
aattcttttt taaataaaat accagataga agtggtattc tccaagagga ttttataatc 1320 
aatggtgatg attatcaaat agtaaaagga gatggagaga gttttaaact gttttcggat 1380 
agaaaagttc ctgttactgg aggaacgaca caagcagctt atcgagtacc gcaaaatcaa 1440 
ctctctgtaa tgagtaatga gggatatgca attaatagtg gatatattta tctctattgg 1500 
agagattaca actgggtcta tccatttgat cctaagacaa agaaagtttc tgcaacgaaa 1560 
caaatcaaaa ctcatggtga gccaacaaca ttatacttta atggaaatat aagacctaaa 1620 
ggttatgaca tttttactgt tgggattggt gtaaacggag atcctggtgc aactcctctt 1680 
gaagctgaga aatttatgca atcaatatca agtaaaacag aaaattatac taatgttgat 1740 
gatacaaata aaatttatga tgagctaaat aaatacttta aaacaattgt tgaggaaaaa 1800 
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cattctattg 


ttgatggaaa 


tgtgactgat 


cctatgggag agatgattga 


attccaatta 


1860 


aaaaatggtc 


aaagttttac 


acatgatgat 


tacgttttgg 


ttggaaatga 


tggcagtcaa 


1920 


ttaaaaaatg 


gtgtggctct 


tggtggacca 


aacagtgatg 


ggggaatttt 


aaaagatgtt 


1980 


acagtgactt 


atgataagac 


atctcaaacc 


atcaaaatca 


atcatttgaa 


cttaggaagt 


2040 


ggacaaaaag 


tagttcttac 


ctatgatgta 


cgtttaaaag 


ataactatat 


aagtaacaaa 


2100 


ttttacaata 


caaataatcg 


tacaacgcta 


agtccgaaga 


gtgaaaaaga 


accaaatact 


2160 


attcgtgatt 


tcccaattcc 


caaaattcgt 


gatgttcgtg 


agtttccggt 


actaaccatc 


2220 


agtaatcaga 


agaaaatggg 


tgaggttgaa 


tttattaaag 


ttaataaaga 


caaacattca 


2280 


gaatcgcttt 


tgggagctaa 


gtttcaactt 


cagatagaaa 


aagatttttc 


tgggtataag 


2340 


caatttgttc 


cagagggaag 


tgatgttaca 


acaaagaatg 


atggtaaaat 


ttattttaaa 


2400 


gcacttcaag 


atggtaacta 


taaattatat 


gaaatttcaa 


gtccagatgg 


ctatatagag 


2460 


gttaaaacga 


aacctgttgt 


gacatttaca 


attcaaaatg 


gagaagttac 


gaacctgaaa 


2520 


gcagatccaa 


atgctaataa 


aaatcaaatc 


gggtatcttg 


aaggaaatgg 


taaacatctt 


2580 


attaccaaca 


ctcccaaacg 


cccaccaggt 


gtttttccta 


aaacaggggg 


aattggtaca 


2640 


attgtctata 


tattagttgg 


ttctactttt 


atgatactta 


ccatttgttc 


tttccgtcgt 


2700 


aaacaattgt 


aa 










2712 



<210> 10 
<211> 903 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 10 

Met Met lie Val Asn Asn Gly Tyr Leu Glu Gly Arg Lys Met Lys Lys 
15 10 15 

Arg Gin Lys lie Trp Arg Gly Leu Ser Val Thr Leu Leu lie Leu Ser 
20 25 30 

Gin lie Pro Phe Gly lie Leu Val Gin Gly Glu Thr Gin Asp Thr Asn 
35 40 45 
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Gin Ala Leu Gly Lys 
50 

Pro Leu Gly Lys Ala 
65 

Glu Thr Ser His Glu 
85 

Asn lie Lys Pro Gly 
100 

Gly Tyr Lys Lys Thr 
115 

Gly Ala Thr He He 
130 

Lys Glu Val Leu Asn 
145 

Thr Lys Glu Asn Tyr 
165 

Glu Gin Tyr Lys Ala 
180 

Glu He Ala Glu Gly 
195 

Leu Asp Lys Asn Lys 
210 

Thr Val Glu Thr Lys 
225 

Leu Asp Asn Ser Asn 
245 

Arg Ala Leu Lys Ala 
260 

Thr Ser Asn Lys Asp 
275 

He Phe Asp Gly Thr 
290 



Val He Val Lys Lys Thr 
55 

Thr Phe Val Leu Lys Asn 
70 75 

Thr Val Glu Gly Ser Gly 
90 

Asp Tyr Thr Leu Arg Glu 
105 

Asp Lys Thr Trp Lys Val 
120 

Glu Gly Met Asp Ala Asp 
135 

Ala Gin Tyr Pro Lys Ser 
150 155 

Pro Leu Val Asn Val Glu 
170 

Leu Asn Pro He Asn Gly 
185 

Trp Leu Ser Lys Lys Asn 
200 

Tyr Lys He Glu Leu Thr 
215 

Glu Leu Asn Gin Pro Leu 
230 235 

Ser Met Asn Asn Glu Arg 
250 

Gly Glu Ala Val Glu Lys 
265 

Asn Arg Val Ala Leu Val 
280 

Glu Ala Thr Val Ser Lys 
295 



Gly Asp Asn Ala Thr 
60 

Asp Asn Asp Lys Ser 
80 

Glu Ala Thr Phe Glu 
95 

Glu Thr Ala Pro He 
110 

Lys Val Ala Asp Asn 
125 

Lys Ala Glu Lys Arg 
140 

Ala He Tyr Glu Asp 
160 

Gly Ser Lys Val Gly 
175 

Lys Asp Gly Arg Arg 
190 

Thr Gly Val Asn Asp 
205 

Val Glu Gly Lys Thr 
220 

Asp Val Val Val Leu 
240 

Ala Asn Asn Ser Gin 
255 

Leu He Asp Lys He 

270 

Thr Tyr Ala Ser Thr 
285 

Gly Val Ala Asp Gin 
300 
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Asn Gly Lys Ala Leu Asn Asp Ser Val Ser Trp Asp Tyr His Lys Thr 

310 315 320 

Thr Phe Thr Ala Thr Thr His Asn Tyr Ser Tyr Leu Asn Leu Thr Asn 
325, , 330 335 

Asp Ala Asn Glu Val Asn He Leu Lys Ser Arg He Pro Lys Glu Ala 
340 345 

Glu His He Asn Gly Asp Arg Thr Leu Tyr Gin Phe Gly Ala Thr Phe 
355 360 365 

Thr Gin Lys Ala Leu Met Lys Ala Asn Glu He Leu Glu Thr Gin Ser 
370 375 

Ser Asn Ala Arg Lys Lys Leu He Phe His Val Thr Asp Gly Val Pro 

390 395 400 

Thr Met Ser Tyr Ala He Asn Phe Asn Pro Tyr He Ser Thr Ser Tyr> 
405 410 _ 415 

Gin Asn Gin Phe Asn Ser Phe Leu Asn Lys He Pro Asp Arg Ser Gly 
420 425 430 

He Leu Gin Glu Asp Phe He He Asn Gly Asp Asp Tyr Gin He Val 
435 440 445 

Lys Gly Asp Gly Glu Ser Phe Lys Leu Phe Ser Asp Arg Lys Val Pro 
450 455 460 

Val Thr Gly Gly Thr Thr Gin Ala Ala Tyr Arg Val Pro Gin Asn Gin 

470 475 480 

Leu Ser Val Met Ser Asn Glu Gly Tyr Ala He Asn Ser Gly Tyr He 
485 490 495 

Tyr Leu Tyr Trp Arg Asp Tyr Asn Trp Val Tyr Pro Phe Asp Pro Lys 
500 505 510 

Thr Lys Lys Val Ser Ala Thr Lys Gin He Lys Thr His Gly Glu Pro 
515 520 525 

Thr Thr Leu Tyr Phe Asn Gly Asn He Arg Pro Lys Gly Tyr Asp He 
530 535 540 

Phe Thr Val Gly He Gly Val Asn Gly Asp Pro Gly Ala Thr Pro Leu 

550 555 560 
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Glu Ala Glu Lys Phe Met Gin Ser lie Ser Ser Lys Thr Glu Asn Tyr 
565 570 575 

Thr Asn Val Asp Asp Thr Asn Lys lie Tyr Asp Glu Leu Asn Lys Tyr 
580 585 590 

Phe Lys Thr He Val Glu Glu Lys His Ser He Val Asp Gly Asn Val 
. 600 605 

Thr Asp Pro Met Gly Glu Met He Glu Phe Gin Leu Lys Asn Gly Gin 
610 615 620 

Ser Phe Thr His Asp Asp Tyr Val Leu Val Gly Asn Asp Gly Ser Gin 
625 630 635 640 

Leu Lys Asn Gly Val Ala Leu Gly Gly Pro Asn Ser Asp Gly Gly He 
645 650 655 

Leu Lys Asp Val Thr Val Thr Tyr Asp Lys Thr Ser Gin Thr He Lys- 
660 665 670 

r ~ 

He Asn His Leu Asn Leu Gly Ser Gly Gin Lys Val Val Leu Thr Tyr 
675 680 685 

Asp Val Arg Leu Lys Asp Asn Tyr He Ser Asn Lys Phe Tyr Asn Thr 
690 695 700 

Asn Asn Arg Thr Thr Leu Ser Pro Lys Ser Glu Lys Glu Pro Asn Thr 
705 710 715 720 

He Arg Asp Phe Pro He Pro Lys He Arg Asp Val Arg Glu Phe Pro 
725 730 735 

Val Leu Thr He Ser Asn Gin Lys Lys Met Gly Glu Val Glu Phe He 
740 745 750 

Lys Val Asn Lys Asp Lys His Ser Glu Ser Leu Leu Gly Ala Lys Phe 
755 760 765 



Gin Leu Gin He Glu Lys Asp Phe 
770 775 

Glu Gly Ser Asp Val Thr Thr Lys 
785 790 

Ala Leu Gin Asp Gly Asn Tyr Lys 
805 



Ser Gly Tyr Lys Gin Phe Val Pro 
780 

Asn Asp Gly Lys He Tyr Phe Lys 

795 800 

Leu Tyr Glu He Ser Ser Pro Asp 
810 815 
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Gly Tyr lie Glu Val Lys Thr Lys Pro Val Val Thr Phe Thr lie Gin 
820 825 830 

Asn Gly Glu Val Thr Asn Leu Lys Ala Asp Pro Asn Ala Asn Lys Asn 
835 840 845 

Gin lie Gly Tyr Leu Glu Gly Asn Gly Lys His Leu lie Thr Asn Thr 
850 855 860 

' Pro Lys Arg Pro Pro Gly Val Phe Pro Lys Thr Gly Gly lie Gly Thr 
865 870 875 880 

lie Val Tyr lie Leu Val Gly Ser Thr Phe Met lie Leu Thr lie Cys 
885 890 895 

Ser Phe Arg Arg Lys Gin Leu 
900 



<210> 11 
<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 11 

ctaggtggat ccttcggcaa t 21 



<21G> 12 

<211> 10 . . ' 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 12 

cgattgccga 10 



<210> 13 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 13 

aggcaactgt gctaaccgag ggaat 



<210> 14 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 14 
cgattccctc g 



<210> 15 
<211> 1509 
<212> DNA 

<213> Streptococcus agalactiae 



<220> 

<221> CDS 

<222> (1) , . (1509) 

<400> 15 

atg aaa aag aaa atg att caa teg ctg tta gtg gcg agt tta gca ttt 48 ' 

Met Lys Lys Lys Met He Gin Ser Leu Leu Val Ala Ser Leu Ala Phe 
^ 5 10 15 



ggt atg get gta 
Gly Met Ala Val 
20 

ggg aca att aca 
Gly Thr He Thr 
35 

tat aaa gtt ttt 
Tyr Lys Val Phe 



tea cca gtt acg ccg 
Ser Pro Val Thr Pro 
25 

gtt caa gat act caa 
Val Gin Asp Thr Gin 
40 

gat gca gaa ata gat 
Asp Ala Glu He Asp 



ata get ttt gee 
He Ala Phe Ala 



aaa gge gca ace 
Lys Gly Ala Thr 
45 

aat gca aat gta 
Asn Ala Asn Val 



get gag aca 96 
Ala Glu Thr 
30 

tat aaa gca 144 
Tyr Lys Ala 



tot gat teg 192 
Ser Asp Ser 
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50 55 60 

aat aaa gat gga get tct tat tta att cct caa ggt aaa gaa get gag 240 

Asn Lys Asp Gly Ala Ser Tyr Leu lie Pro Gin Gly Lys Glu Ala Glu 

65 70 75 80 

tat aaa get tea act gat ttt aat tct ett ttt acg aea act act aat 288 

Tyr Lys Ala Ser Thr Asp Phe Asn Ser Leu Phe Thr Thr Thr Thr Asn 

85 90 95 



gga ggg aga aea tat gta act aaa aaa gat act geg tea gea aat gag 336 
Gly Gly Arg Thr Tyr Val Thr Lys Lys Asp Thr Ala Ser Ala Asn Glu 
100 105 110 



att geg aea tgg get aaa tct ata tea get aat act aea cca gtt tec 38 4 
lie Ala Thr Trp Ala Lys Ser lie Ser Ala Asn Thr Thr Pro Val Ser 
115 120 125 



act gtt act gag tea aat aat gat ggt act gag gtt att aat gtt tec 432 

Thr Val Thr Glu Ser Asn Asn Asp Gly Thr Glu Val He Asn Val Ser-» 
130 135 140 

caa tat gga tat tat tat gtt tct age act gtt aat aat gga get gta 480 

Gin Tyr Gly Tyr Tyr Tyr Val Ser Ser Thr Val Asn Asn Gly Ala Val 

145 150 155 160 



att atg gtt aea tct gta act cca aat get act att cat gaa aag aat 528 
He Met Val Thr Ser Val Thr Pro Asn Ala Thr He His Glu Lys Asn 
165 170 175 



act gat geg aea tgg gga gat ggt ggt gga aaa act gta gat caa aaa 57 6 
Thr Asp Ala Thr Trp Gly Asp Gly Gly Gly Lys Thr Val Asp Gin Lys 
180 185 190 



acg tac teg gtt ggt gat aea gtc aaa tat act att act tat aag aat 624 
Thr Tyr Ser Val Gly Asp" Thr Val Lys Tyr Thr He Thr Tyr Lys Asn 
195 200 205 



gea gtc aat tat cat ggt aea gaa aaa gtg tat caa tat gtt ata aag 672 
Ala Val Asn Tyr His Gly Thr Glu Lys Val Tyr Gin Tyr Val He Lys 
210 215 220 



gat act atg cca tct get tct gta gtt gat ttg aac gaa ggg tct tat 720 
Asp Thr Met Pro Ser Ala Ser Val Val Asp Leu Asn Glu Gly Ser Tyr 
225 230 235 240 



gaa gta act att act gat gga tea ggg aat att aca act eta act caa 7 68 
Glu Val Thr He Thr Asp Gly Ser Gly Asn He Thr Thr Leu Thr Gin 
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245 250 255 

ggt teg gaa aaa gca act ggg aag tat aac ctg tta gag gaa aat aat 816 

Gly Ser Glu Lys Ala Thr Gly Lys Tyr Asn Leu Leu Glu Glu Asn Asn 
260 265 270 

aat ttc acg att act att ccg tgg gca get acc aat act cca acc gga 8 64 

Asn Phe Thr lie Thr lie Pro Trp Ala Ala Thr Asn Thr Pro Thr Gly 
275 280 285 

aat act caa aat gga get aat gat gac ttt ttt tat aag gga ata aat 912 

Asn Thr Gin Asn Gly Ala Asn Asp Asp Phe Phe Tyr Lys Gly He Asn 
290 295 300 

aca ate aca gte act tat aca gga gta tta aag agt gga get aaa cca 960 

Thr He Thr Val Thr Tyr Thr Gly Val Leu Lys Ser Gly Ala Lys Pro 

305 310 315 320 

ggt tea get gat tta cca gaa aat aca aac att gcg acc ate aac ccc 1008 

Gly Ser Ala Asp- Leu Pro Glu Asn Thr Asn He Ala Thr He Asn Pro ^ 
325 330 ^ 335 

aat act age aat gat gac cca ggt caa aaa gta aca gtg agg gat ggt 1056 

Asn Thr Ser Asn Asp Asp Pro Gly Gin Lys Val Thr Val Arg Asp Gly 
340 345 350 

caa att act ata aaa aaa att gat ggt tec aca' aaa get tea tta caa 1104 

Gin He Thr He Lys Lys He Asp Gly Ser Thr Lys Ala Ser Leu Gin 
355 360 365 

ggt get ata ttt gtt tta aag aat get acg ggt caa ttt eta aac ttt 1152 

Gly Ala He Phe Val Leu Lys Asn Ala Thr Gly Gin Phe Leu Asn Phe 
370 375 380 

aac gat aca aat aac gtt gaa tgg gge aca gaa get aat gca aca gaa 1200 

Asn Asp Thr Asn Asn Val Glu Trp Gly Thr Glu Ala Asn Ala Thr Glu 

385 390 395 400 

tat aca aca gga gca gat ggt ata att ace att aca gge ttg aaa gaa 1248 

Tyr Thr Thr Gly Ala Asp Gly He He Thr He Thr Gly Leu Lys Glu 
405 410 415 

ggt aca tac tat eta gtt gag aaa aag get ccc tta ggt tac aat ttg 1296 

Gly Thr Tyr Tyr Leu Val Glu Lys Lys Ala Pro Leu Gly Tyr Asn Leu 
420 425 430 

tta gat aac tct cag aag gtt att tta gga gat gga gee act gat acg 1344 

Leu Asp Asn Ser Gin Lys Val He Leu Gly Asp Gly Ala Thr Asp Thr 
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435 440 445 

act aat tea gat aac ctt tta gtt aac cca act gtt gaa aat aac aaa 1392 
Thr Asn Ser Asp Asn Leu Leu Val Asn Pro Thr Val Glu Asn Asn Lys 
450 455 460 

ggt act gag ttg cct tea aca ggt ggt att ggt aca aca att ttc tac 1440 
Gly Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr Thr lie Phe Tyr 
465 470 475 480 

att ata ggt gca att tta gta ata gga gca ggt ate gtg ctt gtt get 1488 
He He Gly Ala He Leu Val He Gly Ala Gly He Val Leu Val Ala 
485 490 495 

cgt cgt cgt tta cgt tct taa 1509 
Arg Arg Arg Leu Arg Ser 
500 



<210> 16 
<211> 502 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 16 

Met Lys Lys Lys Met He Gin Ser Leu Leu Val Ala Ser Leu Ala Phe 
1 5 . 10 15 

Gly Met Ala Val Ser Pro Val Thr Pro He Ala Phe Ala Ala Glu Thr 
20 25 30 

Gly Thr He Thr Val Gin Asp Thr Gin Lys Gly Ala Thr Tyr Lys Ala 
35 40 45 

Tyr Lys Val Phe Asp Ala Glu He Asp Asn Ala Asn Val Ser Asp Ser 
50 55 60 

Asn Lys Asp Gly Ala Ser Tyr Leu He Pro Gin Gly Lys Glu Ala Glu 
65 70 75 80 

Tyr Lys Ala Ser Thr Asp Phe Asn Ser Leu Phe Thr Thr Thr Thr Asn 
85 90 95 

Gly Gly Arg Thr Tyr Val Thr Lys Lys Asp Thr Ala Ser Ala Asn Glu 
100 105 110 

He Ala Thr Trp Ala Lys Ser He Ser Ala Asn Thr Thr Pro Val Ser 
115 120 125 
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Thr Val Thr Glu Ser Asn Asn Asp Gly Thr Glu Val He Asn Val Ser 
130 135 

Gin Tyr Gly Tyr Tyr Tyr Val Ser Ser Thr Val Asn Asn Gly Ala Val 

150 155 160 

He Met Val Thr Ser Val Thr Pro Asn Ala Thr He His Glu Lys Asn 
165 170 175 

Thr Asp Ala Thr Trp Gly Asp Gly Gly Gly Lys Thr Val Asp Gin Lys 
180 185 190 

Thr Tyr Ser Val Gly Asp Thr Val Lys Tyr Thr He Thr Tyr Lys Asn 
195 200 205 

Ala Val Asn Tyr His Gly Thr Glu Lys Val Tyr Gin Tyr Val He Lys 
210 215 220 

Asp Thr Met Pro Ser Ala Ser Val Val Asp Leu Asn Glu Gly Ser Tyr> 
225 230 235 240 

Glu Val Thr He Thr Asp Gly Ser Gly Asn He Thr Thr Leu Thr Gin 
245 250 255 

Gly Ser Glu Lys Ala Thr Gly Lys Tyr Asn Leu Leu Glu Glu Asn Asn 
260 265 270 

Asn Phe Thr He Thr He Pro Trp Ala Ala Thr Asn Thr Pro Thr Gly 
275 280 285 

Asn Thr Gin Asn Gly Ala Asn 'Asp Asp Phe Phe Tyr Lys Gly He Asn 
290 295 300 

Thr He Thr Val Thr Tyr Thr Gly Val Leu Lys Ser Gly Ala Lys Pro 
305 310 315 320 

Gly Ser Ala Asp Leu Pro Glu Asn Thr Asn He Ala Thr He Asn Pro 
325 330 335 

Asn Thr Ser Asn Asp Asp Pro Gly Gin Lys Val Thr Val Arg Asp Gly 
340 345 ; 350 

Gin He Thr He Lys Lys He Asp Gly Ser Thr Lys Ala Ser Leu Gin 
355 360 365 

Gly Ala He Phe Val Leu Lys Asn Ala Thr Gly Gin Phe Leu Asn Phe 
370 375 380 
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Asn Asp Thr Asn 
385 

Tyr Thr Thr Gly 



Gly Thr Tyr Tyr 
420 

Leu Asp Asn Ser 
435 

Thr Asn Ser Asp 
450 

Gly Thr Glu Leu 
465 

He lie Gly Ala 



Arg Arg Arg Leu 
500 



Asn Val Glu Trp 
390 

Ala Asp Gly He 
405 

Leu Val Glu Lys 



Gin Lys Val lie 
440 

Asn Leu Leu Val 
455 

Pro Ser Thr Gly 
470 

lie Leu Val He 
485 

Arg Ser 



Gly Thr Glu Ala 
395 

He Thr He Thr 
410 

Lys Ala Pro Leu 
425 

Leu Gly Asp Gly 



Asn Pro Thr Val 
4 60 

Gly He Qly Thr 
475 

Gly Ala Gly He 
490 



Asn Ala Thr Glu 
400 

Gly Leu Lys Glu 
415 

Gly Tyr Asn Leu 
430 

Ala Thr Asp Thr 
445 

Glu Asn Asn Lys 



Thr He Phe Tyr 
480 

Val Leu Val Ala^ 
495 



<210> 17 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
<220> 

<223> X can be any amino acid 

<400> 17 ' 
Leu Pro Xaa Thr Gly 
1 5 



<210> 18 
<211> 1683 
<212> DNA 

<213> Streptococcus agalactiae 
<220> 
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<221> CDS 

<222> (1) . . (1683) 



<400> 18 

atg gtg ate gta ttc egg att ata cag ata tta caa ggg att ata tec 48 
Met Val lie Val Phe Arg lie lie Gin He Leu Gin Gly He He Ser 
15 10 15 



aag ate ctt cag gta cat att att ata agt atg att cac gag ata aag 96 
Lys He Leu Gin Val His He He He Ser Met He His Glu He Lys 
20 25 30 



ate ccg act caa eta aag atg cet att ata ega cag ata eta gtc tea 144 
He Pro Thr Gin Leu Lys Met Pro He He Arg Gin He Leu Val Ser 
35 40 45 



tea aat gtt gat aea aca act aag tac aag tac gta aaa gae get tac 192 

Ser Asn Val Asp Thr Thr Thr Lys Tyr Lys Tyr Val Lys Asp Ala Tyr 
50 55 60 

aaa tta gtc ggt tgg tat tat gtt aat eca tat ggt agt att aga cet 240 

Lys Leu Val Gly Trp Tyr Tyr Val Asn Pro Tyr Gly Ser He Arg Pro 

65 70 75 80 



tat aac ttt tea ggt get gta act caa gat ate aat tta aga get att 288 
Tyr Asn Phe Ser Gly Ala Val Thr Gin Asp He Asn Leu Arg Ala He 
85 90 95 



tgg cga aag get gga gat tat eat att ata tac age aat gat get gtt 336 
Trp Arg Lys Ala Gly Asp Tyr His He He Tyr Ser Asn Asp Ala Val 
100 105 110 

ggt aca gat gga aag eca gca ttg gat get tct ggt cag caa tta caa 384 
Gly Thr Asp Gly Lys Pro Ala Leu Asp Ala Ser Gly Gin Gin Leu Gin 
115 120 125 



aca agt aat gag cet act gae cet gat tec tat gae gat ggc tec cat 432 
Thr Ser Asn Glu Pro Thr Asp Pro Asp Ser Tyr Asp Asp Gly Ser His 
130 135 140 



tea gee tta ctg aga cgt ccg aea 
Ser Ala Leu Leu Arg Arg Pro Thr 
145 150 

ggc tgg tgg tac aat ggt aaa att 
Gly Trp Trp Tyr Asn Gly Lys He 
165 



atg eca gat ggc tat cgt ttc cgt 4 80 
Met Pro Asp Gly Tyr Arg Phe Arg 
155 160 

tat aac cea tat gat tec att gat 528 
Tyr Asn Pro Tyr Asp Ser He Asp 
170 175 
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att gac gcc cat tta gca gat get aat aaa aat ate acc ata aaa cct 576 
He Asp Ala His Leu Ala Asp Ala Asn Lys Asn He Thr He Lys Pro 
^80 2,90 

gtc att att cca gta gga gat ate aaa tta gaa gat acc tec ate aaa 624 
Val He He Pro Val Gly Asp He Lys Leu Glu Asp Thr Ser He Lys 
195 200 205 

tac aat ggt. aac ggt ggt act aga gta gaa aat ggt aat gtg gta aca 672 
Tyr Asn Gly Asn Gly Gly Thr Arg Val Glu Asn Gly Asn Val Val Thr 
210 215 220 

caa gtg gag aca ccg cgt atg gag ttg aat age aca act aca att cct 720 
Gin Val Glu Thr Pro Arg Met Glu Leu Asn Ser Thr Thr Thr He Pro 
225 230 235 240 

gaa aac caa tac ttt aca agg aca ggt tac aac ett att ggt tgg cat . 768 
Glu Asn Gin Tyr Phe Thr Arg Thr Gly Tyr Asn Leu He Gly Trp His 
245 250 255 

eat gat aag gat tta get gat aca gga cgt gtg gaa ttt aca gca ggt 816 
His Asp Lys Asp Leu Ala Asp Thr Gly Arg Val Glu Phe Thr Ala Gly 
260 265 270 

caa tea ata ggt att gat aac aac ett gat gca aca aat acc tta tat 8 64 
Gin Ser He Gly He Asp Asn Asn Leu Asp Ala Thr Asn Thr Leu Tyr 
275 280 285 

get gtt tgg caa cct aaa gaa tac acc gtc gga gta agt aaa act gtc 912 
Ala Val Trp Gin Pro Lys Glu Tyr Thr Val Gly Val Ser Lys Thr Val 
290 295 300 

gtt gga eta gat gaa gat aag acg aaa gac ttc ttg ttt aat cca agt 960 
Val Gly Leu Asp Glu Asp Lys Thr Lys Asp Phe Leu Phe Asn Pro Ser 
305 310 315 320 

gaa acg ttg caa caa gag aat ttt ccg ctg aga gat ggt cag act aag 1008 
Glu Thr Leu Gin Gin Glu Asn Phe Pro Leu Arg Asp Gly Gin Thr Lys 
325 330 335 

gaa ttt aaa gta cct tat gga act tet ata tea ata gat gaa caa gee 1056 
Glu Phe Lys Val Pro Tyr Gly Thr Ser He Ser He Asp Glu Gin Ala 
340 345 350 

tac gat gaa ttt aaa gta tet gag tea att aca gaa aaa aat eta gca 1104 
Tyr Asp Glu Phe Lys Val Ser Glu Ser He Thr Glu Lys Asn Leu Ala 
355 360 365 
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act ggt gaa get gat aaa act tat gat got acc ggc tta caa tec ctg 1152 
Thr Gly Glu Ala Asp Lys Thr Tyr Asp Ala Thr Gly Leu Gin Ser Leu 
370 375 ' 

aca gtt tea gga gac gta gat att age ttt acc aat aca cgt ate aag 1200 
Thr Val Ser Gly Asp Val Asp He Ser Phe Thr Asn Thr Arg He Lys 
385 390 395 400 

caa aaa gta ega eta eag aaa gtt aat gtc gaa aat gat aat aat ttt 1248 
Gin Lys Val Arg Leu Gin Lys Val Asn Val Glu Asn Asp Asn Asn Phe 
405 410 415 

tta gca ggt gca gtt ttt gat att tat gaa tea gat get aat ggg aat 1296 
Leu Ala Gly Ala Val Phe Asp He Tyr Glu Ser Asp Ala Asn Gly Asn 
420 425 430 

aaa get tea cat cct atg tat tea ggg ctg gtg aca aac gat aaa ggc 1344 
Lys Ala Ser His Pro Met Tyr Ser Gly Leu Val Thr Asn Asp Lys Gly 
435 440 445 

ttg tta tta gtg gat get aat aac tac etc agt ttg cea gta gga aaa 1392 
Leu Leu Leu Val Asp Ala Asn Asn Tyr Leu Ser Leu Pro Val Gly Lys 
450 455 460 

tac tac eta aca gag aca aag gee cct cca ggg tac eta eta cct aaa 14 4 0 
Tyr Tyr Leu Thr Glu Thr Lys Ala Pro Pro Gly Tyr Leu Leu Pro Lys 
465 470 475 480 

aat gat gat ata tea gta tta gtg att tet acg gga gtt acc ttt gaa 1488 
Asn Asp Asp He Ser Val Leu Val He Ser Thr Gly Val Thr Phe Glu 
485 490 495 

caa aat ggt aat aat gcg aca cca ata aaa gag aat tta gtg gat gga 1536 
Gin Asn Gly Asn Asn Ala Thr Pro He Lys Glu Asn Leu Val Asp Gly 
500 505 510 

agt aca gta tat act ttt aaa att act aac agt aaa gga aca gaa ttg 1584 
Ser Thr Val Tyr Thr Phe Lys He Thr Asn Ser Lys Gly Thr Glu Leu 
515 520 525 

cct agt act gga ggt att gga aca cae att tat ate eta gtt ggt tta 1632 
Pro Ser Thr Gly Gly He Gly Thr His He Tyr He Leu Val Gly Leu 
530 535 540 

get tta get eta cca tea gga tta ata tta tac tat ega aaa aaa ata 1680 
Ala Leu Ala Leu Pro" Ser Gly Leu He Leu Tyr Tyr Arg Lys Lys He 
545 550 555 560 
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tga 



<210> 19 

<211> '560 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 19 

Met Val He Val Phe Arg He He Gin He Leu Gin Gly He He Ser 
15 10 15 

Lys He Leu Gin Val His He He He Ser Met He His Glu He Lys 
20 25 30 

He Pro Thr Gin Leu Lys Met Pro He He Arg Gin He Leu Val Ser 
35 40 45 

Ser Asn Val Asp Thr Thr Thr Lys Tyr Lys Tyr Val Lys Asp Ala Tyi? 
50 55 60 

Lys Leu Val Gly Trp Tyr Tyr Val Asn Pro Tyr Gly Ser He Arg Pro 
65 70 75 80 

Tyr Asn Phe Ser Gly Ala Val Thr Gin Asp He Asn Leu Arg Ala He 
85 90 95 

Trp Arg Lys Ala Gly Asp Tyr His He He Tyr Ser Asn Asp Ala Val 
100 105 110 

Gly Thr Asp Gly Lys Pro Ala Leu Asp Ala Ser Gly Gin Gin Leu Gin 
115 120 125 

Thr Ser Asn Glu Pro Thr Asp Pro Asp Ser Tyr Asp Asp Gly Ser His 
130 135 140 

Ser Ala Leu Leu Arg Arg Pro Thr Met Pro Asp Gly Tyr Arg Phe Arg 
145 150 155 160 

Gly Trp Trp Tyr Asn Gly Lys He Tyr Asn Pro Tyr Asp Ser He Asp 
165 170 175 

He Asp Ala His Leu Ala Asp Ala Asn Lys Asn He Thr He Lys Pro 
180 185 190 

Val He He Pro Val Gly Asp He Lys Leu Glu Asp Thr Ser He Lys 
195 200 205 



1683 
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Tyr Asn Gly Asn Gly Giy Thr Arg Val Glu Asn Gly Asn Val Val Thr 
210 215 220 

Gin Val Glu Thr Pro Arg Met Glu Leu Asn Ser Thr Thr Thr lie Pro 
225 230 235 240 

Glu Asn Gin Tyr Phe Thr Arg Thr Gly Tyr Asn Leu lie Gly Trp His 
245 250 255 

His Asp Lys Asp Leu Ala Asp Thr Gly Arg Val Glu Phe Thr Ala Gly 
260 265 270 

Gin Ser lie Gly lie Asp Asn Asn Leu Asp Ala Thr Asn Thr Leu Tyr 
275 280 285 

Ala Val Trp Gin Pro Lys Glu Tyr Thr Val Gly Val Ser Lys Thr Val 
290 295 300 

Val Gly Leu Asp Glu Asp Lys Thr Lys Asp Phe Leu Phe Asn Pro Ser"* 
305 310 315 - 320 

Glu Thr Leu Gin Gin Glu Asn Phe Pro Leu Arg Asp Gly Gin Thr Lys 
325 330 335 

Glu Phe Lys Val Pro Tyr Gly Thr Ser lie Ser lie Asp Glu Gin Ala 
340 345 350 

Tyr Asp Glu Phe Lys Val Ser Glu Ser lie Thr Glu Lys Asn Leu Ala 
355 360 365 

Thr Gly Glu Ala Asp Lys Thr Tyr Asp Ala Thr Gly Leu Gin Ser Leu 
370 375 380 

Thr Val Ser Gly Asp Val Asp lie Ser Phe Thr Asn Thr Arg lie Lys 
385 390 395 400 

Gin Lys Val Arg Leu Gin Lys Val Asn Val Glu Asn Asp Asn Asn Phe 
405 410 415 

Leu Ala Gly Ala Val Phe Asp lie Tyr Glu Ser Asp Ala Asn Gly Asn 
420 425 430 

Lys Ala Ser His Pro Met Tyr Ser Gly Leu Val Thr Asn Asp Lys Gly 
435 440 445 

Leu Leu Leu Val Asp Ala Asn Asn Tyr Leu Ser Leu Pro Val Gly Lys 
450 455 460 
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Tyr Tyr Leu Thr Glu Thr Lys Ala Pro Pro Gly Tyr Leu Leu Pro Lys 
465 470 475 480 

Asn Asp Asp lie Ser Val Leu Val He Ser Thr Gly Val Thr Phe Glu 
485 490 495 

Gin Asn Gly Asn Asn Ala Thr. Pro He Lys Glu Asn Leu Val Asp Gly 
500 505 510 

Ser Thr Val Tyr Thr Phe Lys He Thr Asn Ser Lys Gly Thr Glu Leu 
515 520 525 

Pro Ser Thr Gly Gly He Gly Thr His He Tyr He Leu Val Gly Leu 
530 535 540 

Ala Leu Ala Leu Pro Ser Gly Leu He Leu Tyr Tyr Arg Lys Lys He 
545 550 555 560 



<210> 20 
<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
<400> 20 

Leu Pro Ser Thr Gly Gly 
1 5 



<210> 21 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus 
<220> 

<223> X can be any amino acid. 
<400> 21 

Xaa Pro Xaa Thr Gly Gly 
1 5 
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<210> 22 
<211> 2714 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 22 

caatcagaaa ttaccacgtg gcaatgttga 
ctctcttcaa ggggcaatgt tcaaagtcat 
tcttcaaaat ggtaaggaag tagttgtaac 
aggtctagag tatgggacat actatttatg 
attaacatcg cctgtttcct ttacaatcgg 
ggttaaaaat aacaagcgac cacggattga 
tatcttgatg cttgttgcca ttttgttgtt 
aaataactga tattcaatgt acatcattat 
gagtactctg aggtgatgtt aatcaggaat 
gatatgaggc tgggcagatt gtgccagcct 
gactggtctg gtaatcattt taggaatgga 
gtgaatcaga aagaaatgag attttctcgt 
aaaagcgata aaatgatgag tttgaagata 
aaaagcaaaa acgaaataat ctcctattag 
tggcgtatcc gctggtgtct cgcttgtatt 
actttgataa ggaaaaagca acgttggatg 
cacaagcctt caatgactct ttgaataatg 
tgaagaaaaa agggcgagca gagtatgcac 
atgtggaaat ccccgttatt gacgtggatt 
tattgcagca aggggctggg catctagagg 
cccatgcggt gattacggca catacaggtt 

28 



ctttatgaag gtggatggtc ggaccaatac 60 
gaaagaagaa agcggacact atactcctgt 120 
atcagggaaa gatggtcgtt tccgagtgga 180 
ggagctccaa gctccaactg gttatgttca 240 
gaaagatact cgtaaggaac tggtaacagt 300 
tgtgccagat acaggggaag aaaccttgta 360 
tggtagtggt tattatctta cgaaaaaabc 420 
gaaaaagata gcaggctgaa gggaagacca 480 
catggtgatg tggcatgaat cacaataacg 540 
cattgtgggt tattgtttgt aaaacgatag 600 
caggactggg attctgattt aaaatggatg 660 
ttctcttagc agataggatt gtctgttagg 720 
aagggatgct gataaaaatg gtaaaaacaa 780 
gagtggtatt tttcattgga atggcggtaa 8 40 
atcgagtgga atcaaatcaa caaattgctg 900 
aggctgacat tgatgaacga atgaaattgg 960 
tagtgagtgg cgatccttgg tcggaagaaa 1020 
gtatgttaga aatccatgag cggatggggc 1080 
tgccggttta tgctggtact gctgaagagg 1140 
gaacttctct gccgatcgga ggcaattcga 1200 
tgccaacagc taagatgttt acggatttga 1260 
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ccaaacttaa agttggggat aagttttatg tgcacaatat caaggaagtg atggcctatc 1320 
aagtggatca agtaaaggtg attgagccga cgaactttga tgatttattg attgtaccag 1380 
gtcatgatta tgtgaccttg ctgacttgta cgccatacat gatcaatacc catcgtctat 1440 
tggttcgggg gcatcggata ccgtacgtag cagaggttga ggaagaattt attgcagcaa 1500 
acaaactcag tcatctctat cgctacctgt tttatgtggc agttggtttg attgtgattc 15 60 
ttttatggat tattcgacgc ttgcgcaaga agaaaaaaca accggaaaag gctttgaagg 1620 
cgctgaaagc agcaaggaag gaagtgaagg tggaggatgg acaacagtag acgttcacga 1680 
aaaaaaggca caaaaaagaa gaaacatccg ctgatccttc ttctgatttt cttagtagga 1740 
ttcgccgttg cgatatatcc attggtgtct cgttattatt atcgtattga gtcaaacgag 1800 
gttattaaag agtttgatga gacggtttcc cagatggata aggcagaact tgaggagcgt 18 60 
tggcgcttgg ctcaagcctt caatgcgacc ttgaaaccat ctgaaattct tgatcctttt 1920 
acagagcaag agaaaaagaa aggcgtctca gaatatgcca atatgctaaa ggtccatgag 1980 
cggattggct atgtggaaat tcctgcgatt gatcaggaaa ttccgatgta tgtcggaacg 2040 
agtgaggaca ttcttcagaa aggggcaggg ctgttagaag gggcttcgct gcctgttgga 2100 
ggtgaaaata cccatacagt gatcactgct cacagaggat tgccaacggc agaattgttc 2160 
agtcaattgg ataagatgaa aaaaggggat atcttttatc ttcacgtttt agatcaggtg 2220 
ttggcctacc aagtggatca gatagtgacg gtggagccga atgactttga gcctgtcttg 2280 
attcaacatg gggaagatta tgcgaccttg ttgacttgta caccgtatat gattaacagt 2340 
catcgtctgt tggtacgtgg gaagcggatt ccgtatacgg caccaattgc agagcggaat 2400 
cgagcggtga gagagcgtgg gcaattctgg ttgtggttat tactaggagc gatggcggtc 24 60 
atccttctct tgctgtatcg cgtgtatcgt aatcgacgga ttgtcaaagg actagaaaag 2520 
caattggagg ggcgtcatgt caaggactaa actacgagcc ttattgggat acttgttgat 2580 
gttggtagcc tgtttgattc ctatttattg ttttggacag atggtgttgc agtctcttgg 2 64 0 
acaggtgaaa ggtcatgcta catttgtgaa atccatgaca actgaaatgt accaagaaca 2700 
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acagaaccat tctc 2714 



<210> 23 
<211> 297 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 23 

Met Asp Asn Ser Arg Arg Ser Arg Lys Lys Gly Thr Lys Lys Lys Lys 
15 10 15 

His Pro Leu lie Leu Leu Leu lie Phe Leu Val Gly Phe Ala Val Ala 
20 25 30 

lie Tyr Pro Leu Val Ser Arg Tyr Tyr Tyr Arg lie Glu Ser Asn Glu 
35 40 ' 45 

Val lie Lys Glu Phe Asp Glu Thr Val Ser Gin Met Asp Lys Ala Glu^* 
50 55 60 . - 

Leu Glu Glu Arg Trp Arg Leu Ala Gin Ala Phe Asn Ala Thr Leu Lys 
65 70 75 80 

Pro Ser Glu lie Leu Asp Pro Phe Thr Glu Gin Glu Lys Lys Lys Gly 
85 ' 90 95 

Val Ser Glu Tyr Ala Asn Met Leu Lys Val His Glu Arg lie Gly Tyr 
100 105 110 

Val Glu lie Pro Ala lie Asp Gin Glu lie Pro Met Tyr Val Gly Thr 
115 120 125 

Ser Glu Asp lie Leu Gin Lys Gly Ala Gly Leu Leu Glu Gly Ala Ser 
130 135 140 

Leu Pro Val Gly Gly Glu Asn Thr His Thr Val lie Thr Ala His Arg 
145 150 155 160 

Gly Leu Pro Thr Ala Glu Leu Phe Ser Gin Leu Asp Lys Met Lys Lys 
165 170 175 

Gly Asp lie Phe Tyr Leu His Val Leu Asp Gin Val Leu Ala Tyr Gin 
180 185 190 

Val Asp Gin lie Val Thr Val Glu Pro Asn Asp Phe Glu Pro Val Leu 
195 200 • 205 
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lie Gin His Gly Glu Asp Tyr Ala Thr Leu Leu Thr Cys Thr Pro Tyr 
210 215 220 

Met lie Asn Ser His Arg Leu Leu Val Arg Gly Lys Arg lie Pro Tyr 
225 230 235 240 

Thr Ala Pro lie Ala Glu Arg Asn Arg Ala Val Arg Glu Arg Gly Gin 
245 250 255 

Phe Trp Leu Trp Leu Leu Leu Gly Ala Met Ala Val lie Leu Leu Leu 
260 265 270 

Leu Tyr Arg Val Tyr Arg Asn Arg Arg lie Val Lys Gly Leu Glu Lys 
275 280 285 

Gin Leu Glu Gly Arg His Val Lys Asp 
290 295 



<210> 24 
<211> 894 

<212> DNA 

<213> Streptococcus pneumoniae 
<400> 24 

atggacaaca gtagacgttc acgaaaaaaa ggcacaaaaa agaagaaaca tccgctgatc 60 
cttcttctga ttttcttagt aggattcgcc gttgcgatat atccattggt gtctcgttat 120 
tattatcgta ttgagtcaaa cgaggttatt aaagagtttg atgagacggt ttcccagatg 180 
gataaggcag aacttgagga gcgttggcgc ttggctcaag ccttcaatgc gaccttgaaa 240 
ccatctgaaa ttcttgatcc ttttacagag caagagaaaa agaaaggcgt ctcagaatat 300 
gccaatatgc taaaggtcca tgagcggatt ggctatgtgg aaattcctgc gattgatcag 360 
gaaattccga tgtatgtcgg aacgagtgag gacattcttc agaaaggggc agggctgtta 420 
gaaggggctt ' cgctgcctgt tggaggtgaa aatacccata cagtgatcac tgctcacaga 480 
ggattgccaa cggcagaatt gttcagtcaa ttggataaga tgaaaaaagg ggatatcttt 54 0 
tatcttcacg ttttagatca ggtgttggcc taccaagtgg atcagatagt gacggtggag 600 
ccgaatgact ttgagcctgt cttgattcaa catggggaag attatgcgac cttgttgact 660 
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tgtacaccgt atatgattaa cagtcatcgt ctgttggtac gtgggaagcg gattccgtat 720 
acggcaccaa ttgcagagcg gaatcgagcg gtgagagagc gtgggcaatt ctggttgtgg 780 
ttattactag gagcgatggc ggtcatcctt ctcttgctgt atcgcgtgta tcgtaatcga 840 
cggattgtca aaggactaga aaagcaattg gaggggcgtc atgtcaagga ctaa 894 

<210> 25 
<211> 3010 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 25 

tgttaggaaa agcgataaaa tgatgagttt gaagataaag ggatgctgat aaaaatggta 60 

aaaacaaaaa agcaaaaacg aaataatctc ctattaggag tggtattttt cattggaatg 120 

gcggtaatgg cgtatccgct ggtgtctcgc ttgtattatc gagtggaatc aaatcaadaa 180 

attgctgact ttgataagga aaaagcaacg ttggatgagg ctgacattga tgaacgaatg 240 

aaattggcac aagccttcaa tgactctttg aataatgtag tgagtggcga tccttggtcg 300 

gaagaaatga agaaaaaagg gcgagcagag tatgcacgta tgttagaaat ccatgagcgg 360 

atggggcatg tggaaatccc cgttattgac gtggatttgc cggtttatgc tggtactgct 420 

gaagaggtat tgcagcaagg ggctgggcat ctagagggaa cttctctgcc gatcggaggc 480 

aattcgaccc atgcggtgat tacggcacat acaggtttgc caacagctaa gatgtttacg 540 

gatttgacca aacttaaagt tggggataag ttttatgtgc acaatatcaa ggaagtgatg 600 

gcctatcaag tggatcaagt aaaggtgatt gagccgacga actttgatga tttattgatt 660 

gtaccaggtc atgattatgt gaccttgctg acttgtacgc catacatgat caatacccat 720 

cgtctattgg ttcgggggca tcggataccg tacgtagcag aggttgagga agaatttatt 780 

gcagcaaaca aactcagtca tctctatcgc tacctgtttt atgtggcagt tggtttgatt 8 40 

gtgattcttt tatggattat tcgacgcttg cgcaagaaga 'aaaaacaacc ggaaaaggct 900 

ttgaaggcgc tgaaagcagc aaggaaggaa gtgaaggtgg aggatggaca acagtagacg 960 

ttcacgaaaa aaaggcacaa aaaagaagaa acatccgctg atccttcttc tgattttctt 1020 
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agtaggattc gccgttgcga tatatccatt 
aaacgaggtt attaaagagt ttgatgagac 
ggagcgttgg cgcttggctc aagccttcaa 
tccttttaca gagcaagaga aaaagaaagg 
ccatgagcgg attggctatg tggaaattcc 
cggaacgagt gaggacattc ttcagaaagg 
tgttggaggt gaaaataccc atacagtgat 
attgttcagt caattggata agatgaaaaa 
tcaggtgttg gcctaccaag tggatcagat 
tgtcttgatt caacatgggg aagattatgc 
taacagtcat cgtctgttgg tacgtgggaa 
gcggaatcga gcggtgagag agcgtgggca 
ggcggtcatc cttctcttgc tgtatcgcgt 
agaaaagcaa ttggaggggc gtcatgtcaa 
tgttgatgtt ggtagcctgt ttgattccta 
ctcttggaca ggtgaaaggt catgctacat 
aagaacaaca gaaccattct ctcgcctaca 
tagatccttt tttggcggag ggatatgagg 
cagtctatgg ttacttgtct attccaagtt 
cagattatca tcatttaggg atgggcttgg 
atggtacagg gattcgctca gtgattgctg 
tccgccattt ggatcagcta aaagttggag 
ttgtagaata tcagatgatg gacacagaga 
aatcggttag ctctaaaaat atcatgacct 



ggtgtctcgt tattattatc gtattgagtc 1080 
ggtttcccag atggataagg cagaacttga 1140 
tgcgaccttg aaaccatctg aaattcttga 1200 
cgtctcagaa tatgccaata tgctaaaggt 1260 
tgcgattgat caggaaattc cgatgtatgt 1320 
ggcagggctg ttagaagggg cttcgctgcc 1380 
cactgctcac agaggattgc caacggcaga 1440 
aggggatatc ttttatcttc acgttttaga 1500 
agtgacggtg gagccgaatg actttgagcc 1560 
gaccttgttg acttgtacac cgtatatgdt 1620 
gcggattccg tatacggcac caattgcaga 1680 
attctggttg tggttattac taggagcgat 17 4 0 
gtatcgtaat cgacggattg tcaaaggact 1800 
ggactaaact acgagcctta ttgggatact 18 60 
tttattgttt tggacagatg gtgttgcagt 1920 
ttgtgaaatc catgacaact gaaatgtacc 1980 
atcaacgctt ggcttcgcaa aatcgcattg 2040 
tcaattacca agtgtctgac gaccctgatg 2100 
tggaaatcat ggagccggtt tatttgggag 2160 
ctcatgtgga tggtacaccg ctgcctctgg 2220 
ggcaccgtgc agagccaagc catgtctttt 2280 
atgctcttta ttatgataat ggccaggaaa 2340 
ttattttacc gtcggaatgg gaaaaattag 2400 
tgataacctg cgatccgatt cctaccttta 24 60 
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ataaacgctt 


attagtgaat 


tttgaacgag 


tcgctgttta 


tcaaaaatca 


gatccacaaa 


2520 


cagctgcagt 


tgcgagggtt 


gcttttacga 


aagaaggaca 


atctgtatcg 


cgtgttgcaa 


2580 


cctctcaatg 


gttgtaccgt 


gggctagtgg 


tactggcatt 


tctgggaatc 


ctgtttgttt 


2640 


tgtggaagct 


agcacgttta 


ctacgaggga 


aataaaaaga 


aatgaaagga 


aagctaaggc 


2700 


tgttcctttt 


tccggctctt 


tgtcaactgt 


agtgggttga 


aaaaaagcta 


agctcgagaa 


2760 


aggacaaatt 


ttgtcctttc 


ttttttgata 


ttcagagcga 


taaaaatccg 


ttttttgaag 


2820 


ttttcaaagt 


ttcgaaaacc 


aaaggcattg 


cgcttgataa 


gtttgatgag 


attattggtc 


2880 


gcttccagtt 


tggcattaga 


atagtgtagt 


tgaagggcgt 


tgataacctt 


ttctttatct 


2940 


ttgaggaagg 


ttttaaagac 


agtctgaaaa 


ataggatgaa 


cctgcttaag 


attgtcctcg 


3000 


ataagttcga 












3010 



<210> 26 

<211> 304 
<212> PflT 

<213> Streptococcus pneumoniae 

<400> 26 

Met Leu He Lys Met Val Lys Thr Lys Lys Gin Lys Arg Asn Asn Leu 
15 10 15 

Leu Leu Gly Val Val Phe Phe He Gly Met Ala Val Met Ala Tyr Pro 
20 25 30 

Leu Val Ser Arg Leu Tyr Tyr Arg Val Glu Ser Asn Gin Gin He Ala 
35 40 45 

Asp Phe Asp Lys Glu Lys Ala Thr Leu Asp Glu Ala Asp He Asp Glu 
50 55 60 

Arg Met Lys Leu Ala Gin Ala Phe Asn Asp Ser Leu Asn Asn Val Val 
65 70 75 80 

Ser Gly Asp Pro Trp Ser Glu Glu Met Lys Lys Lys Gly Arg Ala Glu 
85 90 95 

Tyr Ala Arg Met Leu Glu He His Glu Arg Met Gly His Val Glu He 
100 105 110 
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Pro Val He Asp Vai Asp Leu Pro Val Tyr Ala Gly Thr Ala Glu Glu 
115 120 125 

Val Leu Gin Gin Gly Ala- Gly His Leu Glu Gly Thr Ser Leu Pro He 
130 135 140 

Gly Gly Asn Ser Thr His Ala Val He Thr Ala His Thr Gly Leu Pro 
145 150 155 160 

Thr Ala Lys Met Phe Thr Asp Leu Thr Lys Leu Lys Val Gly Asp Lys 
165 170 175 

Phe Tyr Val His Asn He Lys Glu Val Met Ala Tyr Gin Val Asp Gin 
180 185 190 

Val Lys Val He Glu Pro Thr Asn Phe Asp Asp Leu Leu He Val Pro 
195 200 '205 

Gly His Asp Tyr Val Thr Leu Leu Thr Cys Thr Pro Tyr Met He Asn' 
210 215 220 

Thr His Arg Leu Leu Val Arg Gly His Arg He Pro Tyr Val Ala Glu 
225 230 235 240 

Val' Glu Glu Glu Phe He Ala Ala Asn Lys Leu Ser His Leu Tyr Arg 
245 250 255 

Tyr Leu Phe Tyr Val Ala Val Gly Leu He Val He Leu Leu Trp He 
260 265 270 

He Arg Arg Leu Arg Lys Lys Lys Lys Gin Pro Glu Lys Ala Leu Lys 
275 280 285 

Ala Leu Lys Ala Ala Arg Lys Glu Val Lys Val Glu Asp Gly Gin Gin 
290 295 300 



<210> 27 
<211> 915 
<212> DNA 

<213> Streptococcus pneumoniae 

<400> 27 

atgctgataa aaatggtaaa aacaaaaaag caaaaacgaa ataatctcct attaggagtg 60 
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gtatttttca ttggaatggc ggtaatggcg 
gtggaatcaa atcaacaaat tgctgacttt 
gacattgatg aacgaatgaa attggcacaa 
agtggcgatc cttggtcgga agaaatgaag 
ttagaaatcG atgagcggat ggggcatgtg 
gtttatgctg gtactgctga agaggtattg 
tctctgccga tcggaggcaa ttcgacccat 
acagctaaga tgtttacgga tttgaccaaa 
aatatcaagg aagtgatggc ctatcaagtg 
tttgatgatt tattgattgt accaggtcat 
tacatgatca atacccatcg tctattggtt 
gttgaggaag aatttattgc agcaaacaaa 
gtggcagttg gtttgattgt gattctttta 
aaacaaccgg aaaaggcttt gaaggcgctg 
gatggacaac agtag 

<210> 28 
<211> 2199 
<212> DNA 

<213> Enterococcus faecalis 
<400> 28 

actaaaattc gtttacttta tgcatttaaa 
aaatgaggcg aatgttgata acggtcatac 
tgtgacaggt gggaaacgtt tcattaaagt 
ggcgggagct tcctttgtcg tccgtgatca 
cgatgaaaca acgaaagcag caacttgggt 



tatccgctgg tgtctcgctt gtattatcga 120 
gataaggaaa aagcaacgtt ggatgaggct 180 
gccttcaatg actctttgaa taatgtagtg 240 
aaaaaagggc gagcagagta tgcacgtatg 300 
gaaatccccg ttattgacgt ggatttgccg 360 
cagcaagggg ctgggcatct agagggaact 420 
gcggtgatta cggcacatac aggtttgcca 48 0 
cttaaagttg gggataagtt ttatgtgcac 54 0 
gatcaagtaa aggtgattga gccgacgaac 600 
gattatgtga ccttgctgac ttgtacgcea 660 
cgggggcatc ggataccgta cgtagcagag 720 
ctcagtcatc tctatcgcta cctgttttat 780 
tggattattc gacgcttgcg caagaagaaa 840 
aaagcagcaa ggaaggaagt gaaggtggag 90 0 

915 



tgaaaaagca gatcctacga aaggctttaa 60 
cgacgaccaa acaccaccaa ctgttgaagt IZO 
cgatggcgat gtgacagcga cacaagcctt 180 
aaacagcgac acagcaaatt atttgaaaat 24 0 
gaaaacaaaa gctgaagcaa ctacttttac 300 
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aacaacggct gatggattag ttgatatcac 
agaaactgta gctcctgatg attatgtctt 
tgaacaatca tatggcacaa cagaaaacct 
caaaggtacc ttaccttcaa caggtggcaa 
agtcttgcta cttattgcag gagtctactt 
tctagcatca ccgaagaaat ttttagaaaa 
gctctcatgc tttattttta aggaggaagc 
tgatggtttt atgattcttt tactgattat 
tagcgatgca ttaaataact atctggatca 
aagccaagaa aacaccaaag aaatggctga 
agaattagcg aaaaaaggca gcaatcctgg 
aacgaaaaaa ccagacaaat cctattttga 
aaaaataaat gtccgtttac caatttttga 
aagctccttg ttagaaggaa cctcctatcc 
ttcaggccat cgtggtctcc ctcaagccaa 
aggcgatgaa ttttatatcg aagtcaatgg 
aaaaaccgtt gaaccaactg atacaaaaga 
cactttatta acttgcacac cgtatatgat 
tcgtatccca tatcaaccag aaaaagcagc 
aaatttacta ttatggacat tacttttaat 
tatctggtac aagcgacgga aaaagacgac 
taaacatact aaaaaaaaga gtaaaaaaat 
gcataattga accagagaaa cagaagtatt 
taaaaagcga caaagggcca atcaatcgac 
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agggcttaaa tacggtacct attatttaga 360 
gttaacaaat cggattgaat ttgtggtcaa 420 
agttt caeca gaaaaagtac caaacaaaca 4 80 
aggaatctac gtttacttag gaagtggcgc 540 
tgctagacgt agaaaagaaa atgcttaatt 600 
acaaagagcc tgggccaatc actgtcccag 660 
aatgaagtca aaaaagaaac gtcgtatcat 720 
tggaataggt gcatttgcgt atccttttgt 780 
acaaattatc gctcattatc aagcaaaagc 840 
acttcaagaa aaaatggaaa agaaaaacca 900 
attagatcct ttttctgaaa^cgcaaaaaac 960 
aagtcatacg attggtgttt taaccattcc 1020 
taaaacgaat gcattgctat tggaaaaagg 1080 
tacaggtggt acgaatacac atgcggtcat 114 0 
attatttaca gatttgccag aattaaaaaa 1200 
gaagacgctt gcttatcaag tagatcaaat 12 60 
tttacacatt gagtctggcc aagatctcgt 1320 
aaacagtcat cggttattag ttcgaggaca 1380 
agcggggatg aaaaaagtgg cacaacaaca 14 4 0 
tgcctgtgcg ttaattatta gcggcttcat 1500 
cagaaaacca aagtagtatg acgaaaaggc 15 60 
agcttttcaa tttttaatcc tccttatcgt 1620 
aacgaaataa ctaaaagagc aagccctgaa 1680 
tgtttaaatt cctgccaagt ttggattttt 17 4 0 
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ctgttttttt tcgcgctatc ctcaagcgtg agtaaataat tcaatagtaa gaggagtagc 1800 
aacaccgtga aatcatttgt ggtaaaaagc acatgtaaaa atagaatgac aaagacaaca 1860 
cgggataaca ctcgattccg caaaattaaa aataacttag cacgcataat aaaccaccat 1920 • 
ttcttatcag agataatgaa tctgtttttg tctactcttt agttatatca taaaattctt 1980 
aataatgaaa a'aatgactcg agaaaataat tgaaaaaagt tttttttcct gaatcattat 2040 
tttcgtaaat aaagaataaa cgtgttactc ttggcttatc aaatttggaa ggagtgttaa 2100 
aaatgaaata tctggatatt attgctttaa ttttattgat tgtcggaggt ttaaactggt 2160 
tattagttgg tgcatttaat tttgatttag ttgcaacaa 2199 



<210> 29 
<211> 284 
<212> PRT 

<213> Enterococcus faecalis 
<400> 29 

Met Lys Ser Lys Lys Lys Arg Arg He He Asp Gly Phe Met He Leu 
15 10 15 

Leu Leu He He Gly He Gly Ala Phe Ala Tyr Pro Phe Val Ser Asp - 
20 25 30 

Ala Leu Asn Asn Tyr Leu Asp Gin Gin He He Ala His Tyr Gin Ala 
35 40 45 

Lys Ala Ser Gin Glu Asn Thr Lys Glu Met Ala Glu Leu Gin Glu Lys 
50 55 60 

Met Glu Lys Lys Asn Gin Glu Leu Ala Lys Lys Gly Ser Asn Pro Gly 
65 70 75 80 

Leu Asp Pro Phe Ser Glu Thr Gin Lys Thr Thr Lys Lys Pro Asp Lys 
85 90 95 

Ser Tyr Phe Glu Ser His Thr He Gly Val Leu Thr He Pro Lys He 
100 105 110 

Asn Val Arg Leu Pro He Phe Asp Lys Thr Asn Ala Leu Leu Leu Glu 
115 120 , 125 

Lys Gly Ser Ser Leu Leu Glu Gly Thr Ser Tyr Pro Thr Gly Gly Thr 

38 



wo 02/12294 PCT/USOl/24795 

130 135 140 

Asn Thr His Ala Val lie Ser Gly His Arg Gly Leu Pro Gin Ala Lys 
145 150 155 160 

Leu Phe Thr Asp Leu Pro Glu Leu Lys Lys Gly Asp Glu Phe Tyr He 
165 170 175 

Glu Val Asn Gly Lys Thr Leu Ala Tyr Gin Val Asp Gin He Lys Thr 
180 185 190 

Val Glu Pro Thr Asp Thr Lys Asp Leu His He Glu Ser Gly Gin Asp 
195 200 205 

Leu Val Thr Leu Leu Thr Cys Thr Pro Tyr Met He Asn Ser His Arg 
210 215 220 

Leu Leu Val Arg Gly His Arg He Pro Tyr Gin Pro Glu Lys Ala Ala 
225 230 235 240 

Ala Gly Met Lys Lys Val Ala Gin Gin Gin Asn Leu Leu Leu Trp Thr 
245 250 255 

Leu Leu Leu He Ala Cys Ala Leu He He Ser Gly Phe He He Trp 
260 265 270 

Tyr Lys Arg Arg Lys Lys Thr Thr Arg Lys Pro Lys 
275 280 

<210> 30 
<211> 855 
<212> DNA 

<213> Enterococcus faecalis 
<400> 30 

atgaagtcaa aaaagaaacg tcgtatcatt gatggtttta tgattctttt actgattatt 60 
ggaataggtg catttgcgta tccttttgtt agcgatgcat taaataacta tctggatcaa 120 
caaattatcg ctcattatca agcaaaagca agccaagaaa acaccaaaga aatggctgaa 180 
cttcaagaaa aaatggaaaa gaaaaaccaa gaattagcga aaaaaggcag caatcctgga 240 
ttagatcctt tttctgaaac gcaaaaaaca acgaaaaaac cagacaaatc ctattttgaa 300 
agtcatacga ttggtgtttt aaccattcca aaaataaatg tccgtttacc aatttttgat 360 
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aaaacgaatg cattgctatt ggaaaaagga agctccttgt tagaaggaac ctcctatcct 420 
acaggtggta cgaatacaca tgcggtcatt tcaggccatc gtggtctccc tcaagccaaa 480 
ttatttacag atttgccaga attaaaaaaa ggcgatgaat tttatatcga agtcaatggg 540 
aagacgcttg cttatcaagt agatcaaata aaaaccgttg aaccaactga tacaaaagat 600 
ttacacattg agtctggcca agatctcgtc actttattaa cttgcacacc gtatatgata 660 
aacagtcatc ggttattagt tcgaggacat cgtatcccat atcaaccaga aaaagcagca 720 
gcggggatga aaaaagtggc acaacaacaa aatttactat tatggacatt acttttaatt 780 
gcctgtgcgt taattattag cggcttcatt atctggtaca agcgacggaa aaagacgacc 840 
agaaaaccaa agtag 855 

<210> 31 
<211> 2687 
<212> DNA 

<213> Corynebacterium diphtheriae 
<400> 31 . 

gtggtccgga gtatgacaag aacgctccgg ttcaggtaaa cggcactggt aacggtaacg 60 
atctcgtggt cacctctgac aagaacggca acgtccactt cgagggcctg ttcgtctccg 120 
acgaccagaa tgatccggga aagtcagctg cgcagcgctg ctacgtcctc gtcgagaccg 18 0 
aggccccgac gggcttcgtt actccgaaag atgggacggt cttcccagtt gctgtaaaga 240 
ttggacagac tgctaccact acctacgacg caaaggtcga gaacgtcaag cgcgataccc 300 
ctgacctgcc gctgaccggt ggcaagggtg tgctgttcct gatgattgcc ggtggtctgt 360 
tgctgctggt tgctgttggt gctggtttcg tctttgtacg ccgtatcaac gagtaattga 420 
tttgtcgcgt gattaaataa tcgcgttgcg ccgcccaatg cagggcatca aatgccccgc 480 
cggcgggcat aaacgccggc ggggtgcggt ggctttccac cgcaccccca cattctttgt 54 0 
cagagatttg ctgtttggcc tgtgccaccc ggcatccccc tatatgagaa acggacgtac 600 
ctgtcatggt taccaccgcg tcaccgcgct ctaccggacc ggataaccca gacgcgcaac 660 
caaagcgtcg ttgggtcttt tccggactcg cattgtttgc gtgtataacg gcgctagccg 720 
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gcctcatgtt ggggttgtat ccatctactg 
aactggtaga tctctatgat tccaaaattg 
tacttgaact cgcgcaccgt tataacgacc 
gggctaacgt cccccgcgga gcgggcaaag 
agttgcgtgt tgaccgtacg gatgtcatgg 
atctaccgat ctatcacggc acgagcgata 
aaggtacctc gttaccggtg ggaggaccac 
gcttagctga ggccaccatg ttcactaatc 
ccattgaggt gatgggcgaa gtccttacgt 
cagaggacac taggttcctg caaactcaag 
gtactccgtt gggcatcaat acacatcgca 
ccccgcaatc cgatatcgat gcagcacgtc 
cggtcatttt cgcagtggga tttagcttta 
tgattcctcc aaagaagaag gaagaagaca 
gaaacggcgg ggaaggaacc caacgtggtc 
agttccaatg cgggtaccgg tgagagctca 
gcgatagtca tgcttttgtg cggactgtta 
tccactcaac ttaacaatta tgaacagtct 
gcccaagctg accctgccgt agttgctgct 
tcgctagaaa atggacccct gcaggatccg 
gcctatcagg catacgagaa actcttaggg 
ccggctattt ccgtgaacct tcccatttac 
ggtgttgggc acctttacgg tactgcgcta 
ctaacagcgc attcaggtat ccaaaaatcg 
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cagcgtggtt taacgcccgc gaacaggcca 780 
aaaatgcaac ccctcttagc gcggaacaat 840 
gcctgaccgt aggcgctgct ctcgatccct 900 
aagacggcga cggtatggcc tataaagacc 960 
ctcgtatacg tatcccctct atcaaggtgg 1020 
acactctaaa gaagggcgct ggccatttgg 1080 
gcacccattc cgttatcact gcacaccgtg 1140 
tcaacaaggt tggggtaggg gatagattca 1200 
atgaagtgcg tgaaactcgt gtggtcagcc 1260 
acgatcgtga ccttgtcaca ctcgttactt 1320 
ttctggtgac agctgagcgc attactccca 1380 
aagcttccca aatcggcttc ccttggtggg 1440 
tcgccttgtt cttctggcgt tcgggttaca 1500 
tcgaaagcga agctgatggc gatgaactct 15 60 
agtaccgacg ctaaacactc caccggtacc 1620 
gcgaaaaaga aagcgcagac ggcaattgct 1680 
gggctggtga ttctgttcta tccagtcgtg 1740 
aaactcgccc gacagtttgg tgcagacgct 1800 
gctcttgatg ctgcccatgc ctacaacgat 1860 
tggaccggtg gagatagcac taaggatcct 1920 
gaatatccgg cgatggctca gatctctatc 1980 
cacgggacaa gcgacgccac actcctcaaa 204 0 
cccgttggtg gactggggac gcgttcggtt 2100 
accttctttg acaatttaga aaaggtcaaa 2160 
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aagggtgacg ccatttatgt acgcaatatt ggtgagaccc tgaaatacca agtacgcgac 2220 

atcgaaatca tccgtccagc ggagattgac cgtatccagc caatcccaga ccgagactta 2280 

attaccctcg tgacctgtac accctatgga atcaataccc ataggctttt ggttactgcc 2340 

gaacgtgtcc ctatggaacc cggtgaggcg gaccgtgcat ttgccggtga cggaattgtc 24 00 

tggcagtggt ggatgaagct agctatcggt gtgttggtgg tcatccttct cctaactggg 24 60 

tggctcatta tccgtatttt gcgagctagg aaattcgcga agaaaacagc tggagcagac 2520 

gctgctaaat ctgttgaacc tggtgat^tt gaggcgtcgc taagcgcttc agcggccgag 258 0 

gagtcccagt aatatgcaga aaccaatttc cccaacacat gcaaacaccc aagcagtcgc 2640 

ccattcctga aaggacgccc tactatgaag aagactcact tgttccg 2687 



<210> 32 
<211> 348 
<212> PRT 

<213> Corynebacteriioiu diphtheriae 
<400> 32 

Met Ala Met Asn Ser Glu Thr Ala Gly Lys Glu Pro Asn Val Val Ser 
15 10 15 

Thr Asp Ala Lys His Ser Thr Gly Thr Ser Ser Asn Ala Gly Thr Gly 
20 25 30 

Glu Ser Ser Ala Lys Lys Lys Ala Gin Thr Ala lie Ala Ala lie Val 
35 40 45 

Met Leu Leu Cys Gly Leu Leu Gly Leu Val lie Leu Phe Tyr Pro Val 
50 55 60 

Val Ser Thr Gin Leu Asn Asn Tyr Glu Gin Ser Lys Leu Ala Arg Gin 
65 70 75 80 

Phe Gly Ala Asp Ala Ala Gin Ala Asp Pro Ala Val Val Ala Ala Ala 
85 90 95 

Leu Asp Ala Ala His Ala Tyr Asn Asp Ser Leu Glu Asn Gly Pro Leu 
100 105 110 

Gin Asp Pro Trp Thr Gly Gly Asp Ser Thr Lys Asp Pro Ala Tyr Gin 
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115 120 125 

Ala Tyr Glu Lys Leu Leu Gly Glu Tyr Pro Ala Met Ala Gin lie Ser 
130 135 140 

lie Pro Ala lie Ser Val Asn Leu Pro lie Tyr His Gly Thr Ser Asp 
145 150 155 160 

Ala Thr Leu "Leu Lys Gly Val Gly His Leu Tyr Gly Thr Ala Leu Pro 
165 170 175 

Val Gly Gly Leu Gly Thr Arg Ser Val Leu Thr Ala His Ser Gly He 
180 185 190 

Gin Lys Ser Thr Phe Phe Asp Asn Leu Glu Lys Val Lys Lys Gly Asp 
195 200 205 

Ala He Tyr Val Arg Asn He Gly Glu Thr Leu Lys Tyr Gin Val Arg 
210 215 220 

J^ 

Asp He Glu He He Arg Pro Ala Glu He Asp Arg He Gin Pro He 
225 230 235 240 

Pro Asp Arg Asp Leu He Thr Leu Val Thr Cys Thr Pro Tyr Gly He 
245 250 255 

Asn Thr His Arg Leu Leu Val Thr Ala Glu Arg Val Pro Met Glu Pro 
260 265 270 

Gly Glu Ala Asp Arg Ala Phe Ala Gly Asp Gly He Val Trp Gin Trp 
275 280 ^ 285 

Trp Met Lys Leu Ala He Gly Val Leu Val Val -He Leu Leu Leu Thr 
290 295 300 

Gly Trp Leu He He Arg He Leu Arg Ala Arg Lys Phe Ala Lys Lys 
305 310 315 320 

Thr Ala Gly Ala Asp Ala Ala Lys Ser Val Glu Pro Gly Asp He Glu 
325 330 335 

Ala Ser Leu Ser Ala Ser Ala Ala Glu Glu Ser Gin 
340 345 



<210> 33 

<211> 1047 
<212> DNA 
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<213> Corynebacterium diphtherias 
<400> 33 

atggcgatga actctgaaac ggcggggaag gaacccaacg tggtcagtac cgacgctaaa 60 
cactccaccg gtaccagttc caatgcgggt accggtgaga gctcagcgaa aaagaaagcg 120 
cagacggcaa ttgctgcgat agtcatgctt ttgtgcggac tgttagggct ggtgattctg 180 
ttctatccag tcgtgtccac tcaacttaac aattatgaac agtctaaact cgcccgacag 240 
tttggtgcag acgctgccca agctgaccct gccgtagttg ctgctgctct tgatgctgcc 300 
catgcctaca acgattcgct agaaaatgga cccctgcagg atccgtggac cggtggagat 360 
agcactaagg atcctgccta tcaggcatac gagaaactct taggggaata tccggcgatg 420 
gctcagatct ctatcccggc tatttccgtg aaccttccca tttaccacgg gacaagcgac 480 
gccacactcc tcaaaggtgt tgggcacctt tacggtactg cgctacccgt tggtggact*g 540 
gggacgcgtt cggttctaac agcgcattca ggtatccaaa aatcgacctt ctttgacaat 600 
ttagaaaagg tcaaaaaggg tgacgccatt tatgtacgca atattggtga gaccctgaaa 660 
taccaagtac gcgacatcga aatcatccgt ccagcggaga ttgaccgtat ccagccaatc 720 
ccagaccgag acttaattac cctcgtgacc tgtacaccct atggaatcaa tacccatagg 7 80 
cttttggtta ctgccgaacg tgtccctatg gaacccggtg aggcggaccg tgcatttgcc 84 0 
ggtgacggaa ttgtctggca gtggtggatg aagctagcta tcggtgtgtt ggtggtcatc 900 
cttctcctaa ctgggtggct cattatccgt attttgcgag ctaggaaatt cgcgaagaaa 960 
acagctggag cagacgctgc taaatctgtt gaacctggtg atattgaggc gtcgctaagc 1020 
gcttcagcgg ccgaggagtc ccagtaa 1047 

<210> 34 
<211> 19 ' 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Consensus/Streptococcus pyogenes 
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<220> 

<223> X in position 12 can either be a S/T. 
<220> 

<223> X in position 18 can either be a R/K. 
<400> 34 

Thr Leu Leu Thr Cys Thr Pro Tyr Met lie Asn Xaa His Arg Leu Leu 
15 10 15 

Val Xaa Gly 



<210> 35 
<211> 19 
<212> PRT 

<213> Corynebacterium diphtheriae 

<400> 35 

.* "* 

Thr Leu Val Thr Cys Thr Pro Tyr Gly lie Asn Thr His Arg Leu Leu 
15 10 15 

Val Thr Ala 



<210> 36 

<211> 19 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 36 

Thr Leu Val Thr Cys Thr Pro Tyr Gly Val Asn Thr Lys Arg Leu Leu 
1 5 ' 10 15 

Val Arg Gly 



<210> 37 
<211> 150 

<212> PRT 

<213> Streptococcus pyogenes 



<400> 37 

lie Glu Asn Asn Asp lie Met Gly Tyr Val Glu Val Pro Ser lie Lys 
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1 5 

Val Thr Leu Pro lie Tyr 
20 

Gly Ala Gly His Leu Phe 
35 

Thr His Thr Val He Ser 
50 

Phe Thr Asn Leu Asn Leu 
65 70 

Val Leu Asn Lys Val Leu 
85 

Glu Pro Asp Gin Val Thr 
100 

Ala Thr Leu Val Thr Cys 
115 

Leu Val Arg Gly His Arg 
130 



10 

His Tyr Thr Thr Asp Glu 
25 

Gly Ser Ala Leu Pro Val 
40 

Ala His Arg Gly Leu Pro 
55 60 

Val Lys Lys Gly Asp Thr 
75 

Ala Tyr Lys Val Asp Gin 
90 

Ser Leu Ser Gly Val Met 
105 

Thr Pro Tyr Gly Val Asn 
120 

He Ala Tyr His Tyr Lys 
135 140 



PCT/USOl/24795 



15 

Val Leu Thr Lys 
30 

Gly Gly Asp Gly 
45 

Ser Ala Glu Met 



Phe Tyr Phe Arg 
80 

He Leu Thr Val 
95 

Gly Lys Asp Tyr 
110 

Thr Lys Arg Leu 
125 

Lys Tyr Gin Gin 



Ala Lys Lys Ala Met Lys 
145 150 
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calis and C. diptheriae are also provided. Antibodies to the Ema polypeptides and immunogenic fragments thereof are also provided. 
The present invention relates to the identification and prevention of infections by virulent forms of streptococci- This invention pro- 
vides pharmaceutical compositions, immunogenic compositions, vaccines, and diagnostic and therapeutic methods of use of the 
isolated polypeptides, antibodies thereto, and nucleic acids. Assays for compounds which modulate the polypeptides of the present 
invention for use in therapy are also provided. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1, 2, 25, 26 and patially 11-24, 35-51 

Streptococcal polypeptide EmaA comprising SEQ ID N0:2, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID N0:1 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 



2. Claims: 3, 4, 27, 28 and partially 11-24, 35-51 

Streptococcal polypeptide EmaB comprising SEQ ID N0:4, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID NO: 3 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 



3. Claims: 5, 5, 29, 30 and partially 11-24, 35-51 

Streptococcal polypeptide EmaC comprising SEQ ID NO: 6, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID NO: 5 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 



4. Claims: 7, 8, 31, 32 and partially 11-24, 35-51 

Streptococcal polypeptide EmaD comprising SEQ ID NO: 8, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID N0:7 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 
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5. Claims: 9, 10, 33, 34 and partially 11-24, 35-51 

Streptococcal polypeptide EmaE comprising SEQ ID NO: 10, 
analogs, variants and fragments thereof; vaccine, and 
pharmaceutical compositions comprising the same; antibody 
and pharmaceutical composition thereof and cell line 
producing the antibody; nucleic acid of SEQ ID NO: 9 encoding 
the polypeptide and variants thereof; vector and host cell 
comprising it and nucleic acid vaccine; use of nucleic acid 
and antibody in diagnosis and use of the pharmaceutical 
compositions for prevention and treatment of infection. 



6. Claims: 52-54 

Streptococcal Ema polypeptide comprising SEQ ID NO: 23 and 
nucleic acid encoding it 



7. Claims: 55-57 

Streptococcal Ema polypeptide comprising SEQ ID NO: 25 and 
nucleic acid encoding it. 



8. Claims: 58, 59 

Streptococcal Ema polypeptide comprising SEQ ID NO: 37 and 
nucleic acid encoding it. 



9. Claims: 60-62 

Enterococcal Ema polypeptide compriding SEQ ID N0:29 and 
nucleic acid encoding it. 



10. Claims: 63-65 

Corynebacteri um Ema polypeptide and nucleic acid encoding it. 



11. Claims: 66, 67 and partially 71 

Polypeptide comprising SEQ ID NO: 34 



12. Claim : 68 and partially 71 

Polypeptide comprising SEQ ID NO: 35 



13. Claims: 69, 70 and partially 71 

Polypeptide comprising SEQ ID NO: 36 



page 2 of 3 



Internationaf Application No. PCT/US 01/24795 

FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



page 3 of 3 



INTFRNATIONAL SEARCH REPORT 

Information on patent family members 


International Application No 

PCT/US 01/24795 


Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 



wo 0012132 



09-03-2000 



AU 5696699 A 

CA 2341177 Al 

EP 1121149 Al 

JP 2002523474 T 

NO 20010981 A 

US 2002159997 Al 

WO 0012132 Al 



21-03-2000 
09-03-2000 

08- 08-2001 

30- 07-2002 
26-04-2001 

31- 10-2002 

09- 03-2000 



WO 9838312 A 03-09-1998 AU 2192497 A 18-09-1998 

CA 2247072 Al 03-09-1998 

EP 0942982 Al 22-09-1999 

JP 2001505061 T 17-04-2001 

WO 9838312 Al 03-09-1998 



Form PCT/lSA/210 (patent family annex) (July 1992) 



